Overview
Brought to you by YData
Dataset statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Number of variables | 78 | 78 |
| Number of observations | 1000000 | 30000 |
| Missing cells | 0 | 0 |
| Missing cells (%) | 0.0% | 0.0% |
| Total size in memory | 595.1 MiB | 17.9 MiB |
| Average record size in memory | 624.0 B | 624.0 B |
Variable types
| Full Dataset | Systematic Sample | |
|---|---|---|
| Numeric | 40 | 40 |
| Text | 38 | 38 |
| Full Dataset | Systematic Sample | |
|---|---|---|
customer_id has unique values | customer_id has unique values | Unique |
membership_years has 99846 (10.0%) zeros | membership_years has 3047 (10.2%) zeros | Zeros |
number_of_children has 199753 (20.0%) zeros | number_of_children has 6001 (20.0%) zeros | Zeros |
transaction_hour has 41756 (4.2%) zeros | transaction_hour has 1312 (4.4%) zeros | Zeros |
avg_discount_used has 10010 (1.0%) zeros | Alert not present in this dataset | Zeros |
in_store_purchases has 10016 (1.0%) zeros | in_store_purchases has 317 (1.1%) zeros | Zeros |
total_returned_items has 100060 (10.0%) zeros | total_returned_items has 3043 (10.1%) zeros | Zeros |
product_stock has 10174 (1.0%) zeros | Alert not present in this dataset | Zeros |
customer_support_calls has 49755 (5.0%) zeros | customer_support_calls has 1560 (5.2%) zeros | Zeros |
website_visits has 10111 (1.0%) zeros | Alert not present in this dataset | Zeros |
| Alert not present in this dataset | discount_applied has 317 (1.1%) zeros | Zeros |
| Alert not present in this dataset | product_return_rate has 315 (1.1%) zeros | Zeros |
Reproduction
| Full Dataset | Systematic Sample | |
|---|---|---|
| Analysis started | 2025-06-06 02:05:49.172463 | 2025-06-06 02:07:41.655463 |
| Analysis finished | 2025-06-06 02:07:41.630050 | 2025-06-06 02:07:45.980651 |
| Duration | 1 minute and 52.46 seconds | 4.33 seconds |
| Software version | ydata-profiling vv4.16.1 | ydata-profiling vv4.16.1 |
| Download configuration | config.json | config.json |
Variables
customer_id
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 1000000 | 30000 |
| Distinct (%) | 100.0% | 100.0% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 500000.5 | 494984.5 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 1000000 | 989968 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 50000.95 | 49499.35 |
| Q1 | 250000.75 | 247492.75 |
| median | 500000.5 | 494984.5 |
| Q3 | 750000.25 | 742476.25 |
| 95-th percentile | 950000.05 | 940469.65 |
| Maximum | 1000000 | 989968 |
| Range | 999999 | 989967 |
| Interquartile range (IQR) | 499999.5 | 494983.5 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 288675.2789 | 285793.1463 |
| Coefficient of variation (CV) | 0.5773499805 | 0.5773779711 |
| Kurtosis | -1.2 | -1.2 |
| Mean | 500000.5 | 494984.5 |
| Median Absolute Deviation (MAD) | 250000 | 247500 |
| Skewness | -2.511790261 × 10-15 | 0 |
| Sum | 5.000005 × 1011 | 1.4849535 × 1010 |
| Variance | 8.333341667 × 1010 | 8.16777225 × 1010 |
| Monotonicity | Strictly increasing | Strictly increasing |
| Value | Count | Frequency (%) |
| 999984 | 1 | < 0.1% |
| 999983 | 1 | < 0.1% |
| 999982 | 1 | < 0.1% |
| 999981 | 1 | < 0.1% |
| 999980 | 1 | < 0.1% |
| 999979 | 1 | < 0.1% |
| 999978 | 1 | < 0.1% |
| 999977 | 1 | < 0.1% |
| 999976 | 1 | < 0.1% |
| 999975 | 1 | < 0.1% |
| Other values (999990) | 999990 |
| Value | Count | Frequency (%) |
| 989440 | 1 | < 0.1% |
| 989407 | 1 | < 0.1% |
| 989374 | 1 | < 0.1% |
| 989341 | 1 | < 0.1% |
| 989308 | 1 | < 0.1% |
| 989275 | 1 | < 0.1% |
| 989242 | 1 | < 0.1% |
| 989209 | 1 | < 0.1% |
| 989176 | 1 | < 0.1% |
| 989143 | 1 | < 0.1% |
| Other values (29990) | 29990 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 34 | 1 | |
| 67 | 1 | |
| 100 | 1 | |
| 133 | 1 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 34 | 1 | |
| 67 | 1 | |
| 100 | 1 | |
| 133 | 1 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 |
age
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 62 | 62 |
| Distinct (%) | < 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 48.496605 | 48.61583333 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 18 | 18 |
| Maximum | 79 | 79 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 18 | 18 |
| 5-th percentile | 21 | 21 |
| Q1 | 33 | 33 |
| median | 49 | 49 |
| Q3 | 64 | 64 |
| 95-th percentile | 76 | 76 |
| Maximum | 79 | 79 |
| Range | 61 | 61 |
| Interquartile range (IQR) | 31 | 31 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 17.87438116 | 17.81809657 |
| Coefficient of variation (CV) | 0.3685697414 | 0.3665080973 |
| Kurtosis | -1.198117884 | -1.189797483 |
| Mean | 48.496605 | 48.61583333 |
| Median Absolute Deviation (MAD) | 15 | 15 |
| Skewness | -0.0002769945754 | -0.005388281958 |
| Sum | 48496605 | 1458475 |
| Variance | 319.493502 | 317.4845655 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 53 | 16423 | 1.6% |
| 54 | 16412 | 1.6% |
| 33 | 16407 | 1.6% |
| 36 | 16363 | 1.6% |
| 62 | 16324 | 1.6% |
| 39 | 16290 | 1.6% |
| 34 | 16284 | 1.6% |
| 40 | 16274 | 1.6% |
| 32 | 16264 | 1.6% |
| 19 | 16248 | 1.6% |
| Other values (52) | 836711 |
| Value | Count | Frequency (%) |
| 47 | 532 | 1.8% |
| 37 | 524 | 1.7% |
| 42 | 520 | 1.7% |
| 32 | 519 | 1.7% |
| 79 | 517 | 1.7% |
| 61 | 516 | 1.7% |
| 53 | 516 | 1.7% |
| 68 | 508 | 1.7% |
| 33 | 507 | 1.7% |
| 40 | 506 | 1.7% |
| Other values (52) | 24835 |
| Value | Count | Frequency (%) |
| 18 | 16003 | |
| 19 | 16248 | |
| 20 | 16116 | |
| 21 | 16016 | |
| 22 | 16211 |
| Value | Count | Frequency (%) |
| 18 | 479 | |
| 19 | 484 | |
| 20 | 462 | |
| 21 | 467 | |
| 22 | 461 |
| Value | Count | Frequency (%) |
| 18 | 479 | |
| 19 | 484 | |
| 20 | 462 | |
| 21 | 467 | |
| 22 | 461 |
| Value | Count | Frequency (%) |
| 18 | 16003 | |
| 19 | 16248 | |
| 20 | 16116 | |
| 21 | 16016 | |
| 22 | 16211 |
gender
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 5 | 5 |
| Mean length | 5.001174 | 5.0023 |
| Min length | 4 | 4 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Other | Other |
| 2nd row | Female | Female |
| 3rd row | Female | Other |
| 4th row | Female | Male |
| 5th row | Female | Male |
| Value | Count | Frequency (%) |
| other | 333734 | |
| female | 333720 | |
| male | 332546 |
| Value | Count | Frequency (%) |
| other | 10119 | |
| female | 9975 | |
| male | 9906 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1333720 | |
| a | 666266 | |
| l | 666266 | |
| O | 333734 | 6.7% |
| t | 333734 | 6.7% |
| h | 333734 | 6.7% |
| r | 333734 | 6.7% |
| F | 333720 | 6.7% |
| m | 333720 | 6.7% |
| M | 332546 | 6.6% |
| Value | Count | Frequency (%) |
| e | 39975 | |
| a | 19881 | |
| l | 19881 | |
| O | 10119 | 6.7% |
| t | 10119 | 6.7% |
| h | 10119 | 6.7% |
| r | 10119 | 6.7% |
| F | 9975 | 6.6% |
| m | 9975 | 6.6% |
| M | 9906 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5001174 |
| Value | Count | Frequency (%) |
| (unknown) | 150069 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1333720 | |
| a | 666266 | |
| l | 666266 | |
| O | 333734 | 6.7% |
| t | 333734 | 6.7% |
| h | 333734 | 6.7% |
| r | 333734 | 6.7% |
| F | 333720 | 6.7% |
| m | 333720 | 6.7% |
| M | 332546 | 6.6% |
| Value | Count | Frequency (%) |
| e | 39975 | |
| a | 19881 | |
| l | 19881 | |
| O | 10119 | 6.7% |
| t | 10119 | 6.7% |
| h | 10119 | 6.7% |
| r | 10119 | 6.7% |
| F | 9975 | 6.6% |
| m | 9975 | 6.6% |
| M | 9906 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5001174 |
| Value | Count | Frequency (%) |
| (unknown) | 150069 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1333720 | |
| a | 666266 | |
| l | 666266 | |
| O | 333734 | 6.7% |
| t | 333734 | 6.7% |
| h | 333734 | 6.7% |
| r | 333734 | 6.7% |
| F | 333720 | 6.7% |
| m | 333720 | 6.7% |
| M | 332546 | 6.6% |
| Value | Count | Frequency (%) |
| e | 39975 | |
| a | 19881 | |
| l | 19881 | |
| O | 10119 | 6.7% |
| t | 10119 | 6.7% |
| h | 10119 | 6.7% |
| r | 10119 | 6.7% |
| F | 9975 | 6.6% |
| m | 9975 | 6.6% |
| M | 9906 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5001174 |
| Value | Count | Frequency (%) |
| (unknown) | 150069 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1333720 | |
| a | 666266 | |
| l | 666266 | |
| O | 333734 | 6.7% |
| t | 333734 | 6.7% |
| h | 333734 | 6.7% |
| r | 333734 | 6.7% |
| F | 333720 | 6.7% |
| m | 333720 | 6.7% |
| M | 332546 | 6.6% |
| Value | Count | Frequency (%) |
| e | 39975 | |
| a | 19881 | |
| l | 19881 | |
| O | 10119 | 6.7% |
| t | 10119 | 6.7% |
| h | 10119 | 6.7% |
| r | 10119 | 6.7% |
| F | 9975 | 6.6% |
| m | 9975 | 6.6% |
| M | 9906 | 6.6% |
income_bracket
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 4 | 4 |
| Mean length | 4.333713 | 4.335333333 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | High | High |
| 2nd row | Medium | High |
| 3rd row | Low | High |
| 4th row | Low | Low |
| 5th row | Low | Low |
| Value | Count | Frequency (%) |
| high | 333612 | |
| medium | 333367 | |
| low | 333021 |
| Value | Count | Frequency (%) |
| high | 10060 | |
| medium | 10000 | |
| low | 9940 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 666979 | |
| H | 333612 | |
| g | 333612 | |
| h | 333612 | |
| M | 333367 | |
| e | 333367 | |
| d | 333367 | |
| u | 333367 | |
| m | 333367 | |
| L | 333021 | |
| Other values (2) | 666042 |
| Value | Count | Frequency (%) |
| i | 20060 | |
| H | 10060 | |
| g | 10060 | |
| h | 10060 | |
| M | 10000 | |
| e | 10000 | |
| d | 10000 | |
| u | 10000 | |
| m | 10000 | |
| L | 9940 | |
| Other values (2) | 19880 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4333713 |
| Value | Count | Frequency (%) |
| (unknown) | 130060 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 666979 | |
| H | 333612 | |
| g | 333612 | |
| h | 333612 | |
| M | 333367 | |
| e | 333367 | |
| d | 333367 | |
| u | 333367 | |
| m | 333367 | |
| L | 333021 | |
| Other values (2) | 666042 |
| Value | Count | Frequency (%) |
| i | 20060 | |
| H | 10060 | |
| g | 10060 | |
| h | 10060 | |
| M | 10000 | |
| e | 10000 | |
| d | 10000 | |
| u | 10000 | |
| m | 10000 | |
| L | 9940 | |
| Other values (2) | 19880 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4333713 |
| Value | Count | Frequency (%) |
| (unknown) | 130060 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 666979 | |
| H | 333612 | |
| g | 333612 | |
| h | 333612 | |
| M | 333367 | |
| e | 333367 | |
| d | 333367 | |
| u | 333367 | |
| m | 333367 | |
| L | 333021 | |
| Other values (2) | 666042 |
| Value | Count | Frequency (%) |
| i | 20060 | |
| H | 10060 | |
| g | 10060 | |
| h | 10060 | |
| M | 10000 | |
| e | 10000 | |
| d | 10000 | |
| u | 10000 | |
| m | 10000 | |
| L | 9940 | |
| Other values (2) | 19880 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4333713 |
| Value | Count | Frequency (%) |
| (unknown) | 130060 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 666979 | |
| H | 333612 | |
| g | 333612 | |
| h | 333612 | |
| M | 333367 | |
| e | 333367 | |
| d | 333367 | |
| u | 333367 | |
| m | 333367 | |
| L | 333021 | |
| Other values (2) | 666042 |
| Value | Count | Frequency (%) |
| i | 20060 | |
| H | 10060 | |
| g | 10060 | |
| h | 10060 | |
| M | 10000 | |
| e | 10000 | |
| d | 10000 | |
| u | 10000 | |
| m | 10000 | |
| L | 9940 | |
| Other values (2) | 19880 |
loyalty_program
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 3 | 3 |
| Median length | 2 | 2 |
| Mean length | 2.499712 | 2.498133333 |
| Min length | 2 | 2 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | No | No |
| 2nd row | No | Yes |
| 3rd row | No | Yes |
| 4th row | No | Yes |
| 5th row | Yes | Yes |
| Value | Count | Frequency (%) |
| no | 500288 | |
| yes | 499712 |
| Value | Count | Frequency (%) |
| no | 15056 | |
| yes | 14944 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 500288 | |
| o | 500288 | |
| Y | 499712 | |
| e | 499712 | |
| s | 499712 |
| Value | Count | Frequency (%) |
| N | 15056 | |
| o | 15056 | |
| Y | 14944 | |
| e | 14944 | |
| s | 14944 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2499712 |
| Value | Count | Frequency (%) |
| (unknown) | 74944 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 500288 | |
| o | 500288 | |
| Y | 499712 | |
| e | 499712 | |
| s | 499712 |
| Value | Count | Frequency (%) |
| N | 15056 | |
| o | 15056 | |
| Y | 14944 | |
| e | 14944 | |
| s | 14944 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2499712 |
| Value | Count | Frequency (%) |
| (unknown) | 74944 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 500288 | |
| o | 500288 | |
| Y | 499712 | |
| e | 499712 | |
| s | 499712 |
| Value | Count | Frequency (%) |
| N | 15056 | |
| o | 15056 | |
| Y | 14944 | |
| e | 14944 | |
| s | 14944 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2499712 |
| Value | Count | Frequency (%) |
| (unknown) | 74944 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 500288 | |
| o | 500288 | |
| Y | 499712 | |
| e | 499712 | |
| s | 499712 |
| Value | Count | Frequency (%) |
| N | 15056 | |
| o | 15056 | |
| Y | 14944 | |
| e | 14944 | |
| s | 14944 |
membership_years
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 10 | 10 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 4.497453 | 4.486833333 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 9 | 9 |
| Zeros | 99846 | 3047 |
| Zeros (%) | 10.0% | 10.2% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0 | 0 |
| Q1 | 2 | 2 |
| median | 4 | 4 |
| Q3 | 7 | 7 |
| 95-th percentile | 9 | 9 |
| Maximum | 9 | 9 |
| Range | 9 | 9 |
| Interquartile range (IQR) | 5 | 5 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 2.872405571 | 2.872809613 |
| Coefficient of variation (CV) | 0.6386738385 | 0.640275535 |
| Kurtosis | -1.22454665 | -1.214547006 |
| Mean | 4.497453 | 4.486833333 |
| Median Absolute Deviation (MAD) | 3 | 2 |
| Skewness | 0.001590463324 | 0.007875075606 |
| Sum | 4497453 | 134605 |
| Variance | 8.250713764 | 8.253035073 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 100686 | |
| 5 | 100183 | |
| 4 | 100137 | |
| 9 | 99977 | |
| 2 | 99964 | |
| 8 | 99891 | |
| 6 | 99865 | |
| 0 | 99846 | |
| 7 | 99728 | |
| 3 | 99723 |
| Value | Count | Frequency (%) |
| 9 | 3065 | |
| 5 | 3057 | |
| 0 | 3047 | |
| 4 | 3044 | |
| 6 | 3042 | |
| 1 | 2991 | |
| 2 | 2986 | |
| 3 | 2977 | |
| 7 | 2915 | |
| 8 | 2876 |
| Value | Count | Frequency (%) |
| 0 | 99846 | |
| 1 | 100686 | |
| 2 | 99964 | |
| 3 | 99723 | |
| 4 | 100137 |
| Value | Count | Frequency (%) |
| 0 | 3047 | |
| 1 | 2991 | |
| 2 | 2986 | |
| 3 | 2977 | |
| 4 | 3044 |
| Value | Count | Frequency (%) |
| 0 | 3047 | |
| 1 | 2991 | |
| 2 | 2986 | |
| 3 | 2977 | |
| 4 | 3044 |
| Value | Count | Frequency (%) |
| 0 | 99846 | |
| 1 | 100686 | |
| 2 | 99964 | |
| 3 | 99723 | |
| 4 | 100137 |
churned
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 3 | 3 |
| Median length | 2 | 2 |
| Mean length | 2.499729 | 2.4964 |
| Min length | 2 | 2 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | No | No |
| 2nd row | No | Yes |
| 3rd row | No | Yes |
| 4th row | No | Yes |
| 5th row | Yes | No |
| Value | Count | Frequency (%) |
| no | 500271 | |
| yes | 499729 |
| Value | Count | Frequency (%) |
| no | 15108 | |
| yes | 14892 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 500271 | |
| o | 500271 | |
| Y | 499729 | |
| e | 499729 | |
| s | 499729 |
| Value | Count | Frequency (%) |
| N | 15108 | |
| o | 15108 | |
| Y | 14892 | |
| e | 14892 | |
| s | 14892 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2499729 |
| Value | Count | Frequency (%) |
| (unknown) | 74892 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 500271 | |
| o | 500271 | |
| Y | 499729 | |
| e | 499729 | |
| s | 499729 |
| Value | Count | Frequency (%) |
| N | 15108 | |
| o | 15108 | |
| Y | 14892 | |
| e | 14892 | |
| s | 14892 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2499729 |
| Value | Count | Frequency (%) |
| (unknown) | 74892 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 500271 | |
| o | 500271 | |
| Y | 499729 | |
| e | 499729 | |
| s | 499729 |
| Value | Count | Frequency (%) |
| N | 15108 | |
| o | 15108 | |
| Y | 14892 | |
| e | 14892 | |
| s | 14892 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2499729 |
| Value | Count | Frequency (%) |
| (unknown) | 74892 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 500271 | |
| o | 500271 | |
| Y | 499729 | |
| e | 499729 | |
| s | 499729 |
| Value | Count | Frequency (%) |
| N | 15108 | |
| o | 15108 | |
| Y | 14892 | |
| e | 14892 | |
| s | 14892 |
marital_status
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 8 | 8 |
| Median length | 7 | 7 |
| Mean length | 7.000866 | 7.0019 |
| Min length | 6 | 6 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Divorced | Divorced |
| 2nd row | Married | Divorced |
| 3rd row | Married | Married |
| 4th row | Divorced | Married |
| 5th row | Divorced | Single |
| Value | Count | Frequency (%) |
| divorced | 333816 | |
| married | 333234 | |
| single | 332950 |
| Value | Count | Frequency (%) |
| married | 10085 | |
| divorced | 9986 | |
| single | 9929 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 1000284 | |
| i | 1000000 | |
| e | 1000000 | |
| d | 667050 | |
| D | 333816 | 4.8% |
| v | 333816 | 4.8% |
| c | 333816 | 4.8% |
| o | 333816 | 4.8% |
| M | 333234 | 4.8% |
| a | 333234 | 4.8% |
| Other values (4) | 1331800 |
| Value | Count | Frequency (%) |
| r | 30156 | |
| i | 30000 | |
| e | 30000 | |
| d | 20071 | |
| a | 10085 | 4.8% |
| M | 10085 | 4.8% |
| D | 9986 | 4.8% |
| v | 9986 | 4.8% |
| o | 9986 | 4.8% |
| c | 9986 | 4.8% |
| Other values (4) | 39716 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7000866 |
| Value | Count | Frequency (%) |
| (unknown) | 210057 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 1000284 | |
| i | 1000000 | |
| e | 1000000 | |
| d | 667050 | |
| D | 333816 | 4.8% |
| v | 333816 | 4.8% |
| c | 333816 | 4.8% |
| o | 333816 | 4.8% |
| M | 333234 | 4.8% |
| a | 333234 | 4.8% |
| Other values (4) | 1331800 |
| Value | Count | Frequency (%) |
| r | 30156 | |
| i | 30000 | |
| e | 30000 | |
| d | 20071 | |
| a | 10085 | 4.8% |
| M | 10085 | 4.8% |
| D | 9986 | 4.8% |
| v | 9986 | 4.8% |
| o | 9986 | 4.8% |
| c | 9986 | 4.8% |
| Other values (4) | 39716 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7000866 |
| Value | Count | Frequency (%) |
| (unknown) | 210057 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 1000284 | |
| i | 1000000 | |
| e | 1000000 | |
| d | 667050 | |
| D | 333816 | 4.8% |
| v | 333816 | 4.8% |
| c | 333816 | 4.8% |
| o | 333816 | 4.8% |
| M | 333234 | 4.8% |
| a | 333234 | 4.8% |
| Other values (4) | 1331800 |
| Value | Count | Frequency (%) |
| r | 30156 | |
| i | 30000 | |
| e | 30000 | |
| d | 20071 | |
| a | 10085 | 4.8% |
| M | 10085 | 4.8% |
| D | 9986 | 4.8% |
| v | 9986 | 4.8% |
| o | 9986 | 4.8% |
| c | 9986 | 4.8% |
| Other values (4) | 39716 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7000866 |
| Value | Count | Frequency (%) |
| (unknown) | 210057 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 1000284 | |
| i | 1000000 | |
| e | 1000000 | |
| d | 667050 | |
| D | 333816 | 4.8% |
| v | 333816 | 4.8% |
| c | 333816 | 4.8% |
| o | 333816 | 4.8% |
| M | 333234 | 4.8% |
| a | 333234 | 4.8% |
| Other values (4) | 1331800 |
| Value | Count | Frequency (%) |
| r | 30156 | |
| i | 30000 | |
| e | 30000 | |
| d | 20071 | |
| a | 10085 | 4.8% |
| M | 10085 | 4.8% |
| D | 9986 | 4.8% |
| v | 9986 | 4.8% |
| o | 9986 | 4.8% |
| c | 9986 | 4.8% |
| Other values (4) | 39716 |
number_of_children
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 5 | 5 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 2.000554 | 2.0023 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 4 | 4 |
| Zeros | 199753 | 6001 |
| Zeros (%) | 20.0% | 20.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0 | 0 |
| Q1 | 1 | 1 |
| median | 2 | 2 |
| Q3 | 3 | 3 |
| 95-th percentile | 4 | 4 |
| Maximum | 4 | 4 |
| Range | 4 | 4 |
| Interquartile range (IQR) | 2 | 2 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 1.414214161 | 1.4163668 |
| Coefficient of variation (CV) | 0.7069112661 | 0.7073699248 |
| Kurtosis | -1.300270709 | -1.302969252 |
| Mean | 2.000554 | 2.0023 |
| Median Absolute Deviation (MAD) | 1 | 1 |
| Skewness | -0.0001223295646 | -0.0001202378719 |
| Sum | 2000554 | 60069 |
| Variance | 2.000001693 | 2.006094913 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 200307 | |
| 4 | 200157 | |
| 3 | 200053 | |
| 0 | 199753 | |
| 2 | 199730 |
| Value | Count | Frequency (%) |
| 4 | 6057 | |
| 0 | 6001 | |
| 1 | 5996 | |
| 2 | 5993 | |
| 3 | 5953 |
| Value | Count | Frequency (%) |
| 0 | 199753 | |
| 1 | 200307 | |
| 2 | 199730 | |
| 3 | 200053 | |
| 4 | 200157 |
| Value | Count | Frequency (%) |
| 0 | 6001 | |
| 1 | 5996 | |
| 2 | 5993 | |
| 3 | 5953 | |
| 4 | 6057 |
| Value | Count | Frequency (%) |
| 0 | 6001 | |
| 1 | 5996 | |
| 2 | 5993 | |
| 3 | 5953 | |
| 4 | 6057 |
| Value | Count | Frequency (%) |
| 0 | 199753 | |
| 1 | 200307 | |
| 2 | 199730 | |
| 3 | 200053 | |
| 4 | 200157 |
education_level
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 11 | 11 |
| Median length | 10 | 10 |
| Mean length | 8.00064 | 8.0221 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Bachelor's | Bachelor's |
| 2nd row | PhD | Master's |
| 3rd row | Bachelor's | PhD |
| 4th row | Master's | High School |
| 5th row | Bachelor's | PhD |
| Value | Count | Frequency (%) |
| bachelor's | 250360 | |
| high | 250105 | |
| school | 250105 | |
| phd | 250079 | |
| master's | 249456 |
| Value | Count | Frequency (%) |
| bachelor's | 7633 | |
| master's | 7504 | |
| high | 7464 | |
| school | 7464 | |
| phd | 7399 |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 1000649 | |
| o | 750570 | 9.4% |
| s | 749272 | 9.4% |
| c | 500465 | 6.3% |
| l | 500465 | 6.3% |
| e | 499816 | 6.2% |
| a | 499816 | 6.2% |
| ' | 499816 | 6.2% |
| r | 499816 | 6.2% |
| B | 250360 | 3.1% |
| Other values (9) | 2249595 |
| Value | Count | Frequency (%) |
| h | 29960 | |
| s | 22641 | 9.4% |
| o | 22561 | 9.4% |
| ' | 15137 | 6.3% |
| r | 15137 | 6.3% |
| a | 15137 | 6.3% |
| e | 15137 | 6.3% |
| l | 15097 | 6.3% |
| c | 15097 | 6.3% |
| B | 7633 | 3.2% |
| Other values (9) | 67126 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8000640 |
| Value | Count | Frequency (%) |
| (unknown) | 240663 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| h | 1000649 | |
| o | 750570 | 9.4% |
| s | 749272 | 9.4% |
| c | 500465 | 6.3% |
| l | 500465 | 6.3% |
| e | 499816 | 6.2% |
| a | 499816 | 6.2% |
| ' | 499816 | 6.2% |
| r | 499816 | 6.2% |
| B | 250360 | 3.1% |
| Other values (9) | 2249595 |
| Value | Count | Frequency (%) |
| h | 29960 | |
| s | 22641 | 9.4% |
| o | 22561 | 9.4% |
| ' | 15137 | 6.3% |
| r | 15137 | 6.3% |
| a | 15137 | 6.3% |
| e | 15137 | 6.3% |
| l | 15097 | 6.3% |
| c | 15097 | 6.3% |
| B | 7633 | 3.2% |
| Other values (9) | 67126 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8000640 |
| Value | Count | Frequency (%) |
| (unknown) | 240663 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| h | 1000649 | |
| o | 750570 | 9.4% |
| s | 749272 | 9.4% |
| c | 500465 | 6.3% |
| l | 500465 | 6.3% |
| e | 499816 | 6.2% |
| a | 499816 | 6.2% |
| ' | 499816 | 6.2% |
| r | 499816 | 6.2% |
| B | 250360 | 3.1% |
| Other values (9) | 2249595 |
| Value | Count | Frequency (%) |
| h | 29960 | |
| s | 22641 | 9.4% |
| o | 22561 | 9.4% |
| ' | 15137 | 6.3% |
| r | 15137 | 6.3% |
| a | 15137 | 6.3% |
| e | 15137 | 6.3% |
| l | 15097 | 6.3% |
| c | 15097 | 6.3% |
| B | 7633 | 3.2% |
| Other values (9) | 67126 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8000640 |
| Value | Count | Frequency (%) |
| (unknown) | 240663 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| h | 1000649 | |
| o | 750570 | 9.4% |
| s | 749272 | 9.4% |
| c | 500465 | 6.3% |
| l | 500465 | 6.3% |
| e | 499816 | 6.2% |
| a | 499816 | 6.2% |
| ' | 499816 | 6.2% |
| r | 499816 | 6.2% |
| B | 250360 | 3.1% |
| Other values (9) | 2249595 |
| Value | Count | Frequency (%) |
| h | 29960 | |
| s | 22641 | 9.4% |
| o | 22561 | 9.4% |
| ' | 15137 | 6.3% |
| r | 15137 | 6.3% |
| a | 15137 | 6.3% |
| e | 15137 | 6.3% |
| l | 15097 | 6.3% |
| c | 15097 | 6.3% |
| B | 7633 | 3.2% |
| Other values (9) | 67126 |
occupation
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 13 | 13 |
| Median length | 10 | 10 |
| Mean length | 9.500854 | 9.512833333 |
| Min length | 7 | 7 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Self-Employed | Self-Employed |
| 2nd row | Unemployed | Unemployed |
| 3rd row | Self-Employed | Employed |
| 4th row | Employed | Unemployed |
| 5th row | Employed | Employed |
| Value | Count | Frequency (%) |
| employed | 250857 | |
| unemployed | 250117 | |
| self-employed | 249941 | |
| retired | 249085 |
| Value | Count | Frequency (%) |
| self-employed | 7629 | |
| employed | 7552 | |
| retired | 7466 | |
| unemployed | 7353 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1749143 | |
| l | 1000856 | |
| d | 1000000 | |
| o | 750915 | |
| m | 750915 | |
| y | 750915 | |
| p | 750915 | |
| E | 500798 | 5.3% |
| U | 250117 | 2.6% |
| n | 250117 | 2.6% |
| Other values (7) | 1746163 |
| Value | Count | Frequency (%) |
| e | 52448 | |
| l | 30163 | |
| d | 30000 | |
| o | 22534 | |
| p | 22534 | |
| y | 22534 | |
| m | 22534 | |
| E | 15181 | 5.3% |
| S | 7629 | 2.7% |
| f | 7629 | 2.7% |
| Other values (7) | 52199 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9500854 |
| Value | Count | Frequency (%) |
| (unknown) | 285385 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1749143 | |
| l | 1000856 | |
| d | 1000000 | |
| o | 750915 | |
| m | 750915 | |
| y | 750915 | |
| p | 750915 | |
| E | 500798 | 5.3% |
| U | 250117 | 2.6% |
| n | 250117 | 2.6% |
| Other values (7) | 1746163 |
| Value | Count | Frequency (%) |
| e | 52448 | |
| l | 30163 | |
| d | 30000 | |
| o | 22534 | |
| p | 22534 | |
| y | 22534 | |
| m | 22534 | |
| E | 15181 | 5.3% |
| S | 7629 | 2.7% |
| f | 7629 | 2.7% |
| Other values (7) | 52199 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9500854 |
| Value | Count | Frequency (%) |
| (unknown) | 285385 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1749143 | |
| l | 1000856 | |
| d | 1000000 | |
| o | 750915 | |
| m | 750915 | |
| y | 750915 | |
| p | 750915 | |
| E | 500798 | 5.3% |
| U | 250117 | 2.6% |
| n | 250117 | 2.6% |
| Other values (7) | 1746163 |
| Value | Count | Frequency (%) |
| e | 52448 | |
| l | 30163 | |
| d | 30000 | |
| o | 22534 | |
| p | 22534 | |
| y | 22534 | |
| m | 22534 | |
| E | 15181 | 5.3% |
| S | 7629 | 2.7% |
| f | 7629 | 2.7% |
| Other values (7) | 52199 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9500854 |
| Value | Count | Frequency (%) |
| (unknown) | 285385 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1749143 | |
| l | 1000856 | |
| d | 1000000 | |
| o | 750915 | |
| m | 750915 | |
| y | 750915 | |
| p | 750915 | |
| E | 500798 | 5.3% |
| U | 250117 | 2.6% |
| n | 250117 | 2.6% |
| Other values (7) | 1746163 |
| Value | Count | Frequency (%) |
| e | 52448 | |
| l | 30163 | |
| d | 30000 | |
| o | 22534 | |
| p | 22534 | |
| y | 22534 | |
| m | 22534 | |
| E | 15181 | 5.3% |
| S | 7629 | 2.7% |
| f | 7629 | 2.7% |
| Other values (7) | 52199 |
transaction_id
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 632576 | 29558 |
| Distinct (%) | 63.3% | 98.5% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 499891.7314 | 501922.5861 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 2 | 31 |
| Maximum | 999999 | 999999 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 2 | 31 |
| 5-th percentile | 50200.95 | 50175.45 |
| Q1 | 249878.75 | 250889.75 |
| median | 499559.5 | 502763.5 |
| Q3 | 750071.25 | 752751.25 |
| 95-th percentile | 950045.2 | 950656.2 |
| Maximum | 999999 | 999999 |
| Range | 999997 | 999968 |
| Interquartile range (IQR) | 500192.5 | 501861.5 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 288706.0577 | 289037.1838 |
| Coefficient of variation (CV) | 0.5775371735 | 0.5758600865 |
| Kurtosis | -1.200114605 | -1.203309752 |
| Mean | 499891.7314 | 501922.5861 |
| Median Absolute Deviation (MAD) | 250088.5 | 250939 |
| Skewness | 0.002395187253 | -0.008982096119 |
| Sum | 4.998917314 × 1011 | 1.505767758 × 1010 |
| Variance | 8.335118772 × 1010 | 8.354249365 × 1010 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 115913 | 9 | < 0.1% |
| 504562 | 8 | < 0.1% |
| 344167 | 8 | < 0.1% |
| 2773 | 8 | < 0.1% |
| 239407 | 8 | < 0.1% |
| 620816 | 8 | < 0.1% |
| 273197 | 8 | < 0.1% |
| 254678 | 7 | < 0.1% |
| 798940 | 7 | < 0.1% |
| 335691 | 7 | < 0.1% |
| Other values (632566) | 999922 |
| Value | Count | Frequency (%) |
| 295619 | 3 | < 0.1% |
| 875137 | 3 | < 0.1% |
| 101418 | 3 | < 0.1% |
| 620816 | 3 | < 0.1% |
| 44700 | 3 | < 0.1% |
| 288436 | 3 | < 0.1% |
| 258166 | 3 | < 0.1% |
| 257013 | 2 | < 0.1% |
| 784191 | 2 | < 0.1% |
| 495427 | 2 | < 0.1% |
| Other values (29548) | 29973 |
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 3 | 1 | < 0.1% |
| 5 | 3 | |
| 6 | 1 | < 0.1% |
| 7 | 2 |
| Value | Count | Frequency (%) |
| 31 | 1 | |
| 67 | 1 | |
| 111 | 1 | |
| 187 | 1 | |
| 214 | 1 |
| Value | Count | Frequency (%) |
| 31 | 1 | |
| 67 | 1 | |
| 111 | 1 | |
| 187 | 1 | |
| 214 | 1 |
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 3 | 1 | < 0.1% |
| 5 | 3 | |
| 6 | 1 | < 0.1% |
| 7 | 2 |
transaction_date
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 992231 | 29990 |
| Distinct (%) | 99.2% | > 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 984504 | 29980 ? |
| Unique (%) | 98.5% | 99.9% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | 2020-10-11 10:08:52 | 2020-10-11 10:08:52 |
| 2nd row | 2021-12-08 01:07:40 | 2020-05-08 05:45:31 |
| 3rd row | 2020-02-17 09:40:48 | 2021-01-06 13:38:44 |
| 4th row | 2020-08-13 00:43:14 | 2021-04-22 18:40:10 |
| 5th row | 2021-07-02 11:59:03 | 2020-04-04 04:51:10 |
| Value | Count | Frequency (%) |
| 2020-10-05 | 1509 | 0.1% |
| 2020-09-06 | 1467 | 0.1% |
| 2020-10-04 | 1464 | 0.1% |
| 2020-07-26 | 1463 | 0.1% |
| 2020-02-26 | 1458 | 0.1% |
| 2020-05-03 | 1455 | 0.1% |
| 2021-02-27 | 1453 | 0.1% |
| 2021-07-30 | 1451 | 0.1% |
| 2020-09-07 | 1451 | 0.1% |
| 2020-10-09 | 1447 | 0.1% |
| Other values (87119) | 1985382 |
| Value | Count | Frequency (%) |
| 2020-10-23 | 60 | 0.1% |
| 2020-09-16 | 57 | 0.1% |
| 2021-03-12 | 57 | 0.1% |
| 2020-12-22 | 57 | 0.1% |
| 2021-08-18 | 57 | 0.1% |
| 2020-02-06 | 55 | 0.1% |
| 2021-04-26 | 55 | 0.1% |
| 2020-04-22 | 55 | 0.1% |
| 2021-08-13 | 55 | 0.1% |
| 2020-08-01 | 55 | 0.1% |
| Other values (26116) | 59437 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3798683 | |
| 2 | 3414072 | |
| 1 | 2439659 | |
| : | 2000000 | |
| - | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890067 | 4.7% |
| 5 | 800311 | 4.2% |
| 4 | 798703 | 4.2% |
| 7 | 467556 | 2.5% |
| Other values (3) | 1390949 | 7.3% |
| Value | Count | Frequency (%) |
| 0 | 114027 | |
| 2 | 102499 | |
| 1 | 73048 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26567 | 4.7% |
| 5 | 23954 | 4.2% |
| 4 | 23933 | 4.2% |
| 8 | 14066 | 2.5% |
| Other values (3) | 41906 | 7.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3798683 | |
| 2 | 3414072 | |
| 1 | 2439659 | |
| : | 2000000 | |
| - | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890067 | 4.7% |
| 5 | 800311 | 4.2% |
| 4 | 798703 | 4.2% |
| 7 | 467556 | 2.5% |
| Other values (3) | 1390949 | 7.3% |
| Value | Count | Frequency (%) |
| 0 | 114027 | |
| 2 | 102499 | |
| 1 | 73048 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26567 | 4.7% |
| 5 | 23954 | 4.2% |
| 4 | 23933 | 4.2% |
| 8 | 14066 | 2.5% |
| Other values (3) | 41906 | 7.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3798683 | |
| 2 | 3414072 | |
| 1 | 2439659 | |
| : | 2000000 | |
| - | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890067 | 4.7% |
| 5 | 800311 | 4.2% |
| 4 | 798703 | 4.2% |
| 7 | 467556 | 2.5% |
| Other values (3) | 1390949 | 7.3% |
| Value | Count | Frequency (%) |
| 0 | 114027 | |
| 2 | 102499 | |
| 1 | 73048 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26567 | 4.7% |
| 5 | 23954 | 4.2% |
| 4 | 23933 | 4.2% |
| 8 | 14066 | 2.5% |
| Other values (3) | 41906 | 7.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3798683 | |
| 2 | 3414072 | |
| 1 | 2439659 | |
| : | 2000000 | |
| - | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890067 | 4.7% |
| 5 | 800311 | 4.2% |
| 4 | 798703 | 4.2% |
| 7 | 467556 | 2.5% |
| Other values (3) | 1390949 | 7.3% |
| Value | Count | Frequency (%) |
| 0 | 114027 | |
| 2 | 102499 | |
| 1 | 73048 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26567 | 4.7% |
| 5 | 23954 | 4.2% |
| 4 | 23933 | 4.2% |
| 8 | 14066 | 2.5% |
| Other values (3) | 41906 | 7.4% |
product_id
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 9999 | 9507 |
| Distinct (%) | 1.0% | 31.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 4999.564515 | 5014.245233 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 9999 | 9999 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 500 | 512 |
| Q1 | 2498 | 2507.75 |
| median | 4999 | 5016 |
| Q3 | 7498 | 7532.25 |
| 95-th percentile | 9499 | 9500 |
| Maximum | 9999 | 9999 |
| Range | 9998 | 9998 |
| Interquartile range (IQR) | 5000 | 5024.5 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 2886.798391 | 2890.88568 |
| Coefficient of variation (CV) | 0.5774099689 | 0.5765345621 |
| Kurtosis | -1.200144352 | -1.207110001 |
| Mean | 4999.564515 | 5014.245233 |
| Median Absolute Deviation (MAD) | 2500 | 2511 |
| Skewness | 0.0002346107222 | -0.004886028745 |
| Sum | 4999564515 | 150427357 |
| Variance | 8333604.95 | 8357220.013 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 4898 | 145 | < 0.1% |
| 51 | 143 | < 0.1% |
| 9593 | 141 | < 0.1% |
| 5427 | 138 | < 0.1% |
| 3923 | 137 | < 0.1% |
| 8365 | 135 | < 0.1% |
| 4541 | 134 | < 0.1% |
| 2590 | 134 | < 0.1% |
| 467 | 133 | < 0.1% |
| 3676 | 133 | < 0.1% |
| Other values (9989) | 998627 |
| Value | Count | Frequency (%) |
| 2107 | 11 | < 0.1% |
| 9868 | 11 | < 0.1% |
| 9170 | 10 | < 0.1% |
| 3635 | 10 | < 0.1% |
| 9308 | 10 | < 0.1% |
| 4355 | 10 | < 0.1% |
| 8147 | 10 | < 0.1% |
| 4091 | 10 | < 0.1% |
| 8255 | 10 | < 0.1% |
| 9211 | 10 | < 0.1% |
| Other values (9497) | 29898 |
| Value | Count | Frequency (%) |
| 1 | 92 | |
| 2 | 107 | |
| 3 | 117 | |
| 4 | 97 | |
| 5 | 92 |
| Value | Count | Frequency (%) |
| 1 | 2 | < 0.1% |
| 2 | 5 | |
| 3 | 2 | < 0.1% |
| 7 | 3 | |
| 8 | 4 |
| Value | Count | Frequency (%) |
| 1 | 2 | < 0.1% |
| 2 | 5 | |
| 3 | 2 | < 0.1% |
| 7 | 3 | |
| 8 | 4 |
| Value | Count | Frequency (%) |
| 1 | 92 | |
| 2 | 107 | |
| 3 | 117 | |
| 4 | 97 | |
| 5 | 92 |
product_category
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 5 | 5 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 11 | 11 |
| Median length | 9 | 9 |
| Mean length | 8.196389 | 8.207766667 |
| Min length | 4 | 4 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Electronics | Electronics |
| 2nd row | Groceries | Furniture |
| 3rd row | Toys | Toys |
| 4th row | Toys | Toys |
| 5th row | Clothing | Groceries |
| Value | Count | Frequency (%) |
| toys | 200669 | |
| groceries | 200214 | |
| clothing | 199778 | |
| electronics | 199756 | |
| furniture | 199583 |
| Value | Count | Frequency (%) |
| electronics | 6039 | |
| furniture | 6011 | |
| groceries | 5997 | |
| clothing | 5980 | |
| toys | 5973 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 999350 | |
| o | 800417 | |
| e | 799767 | |
| i | 799331 | |
| s | 600639 | 7.3% |
| c | 599726 | 7.3% |
| n | 599117 | 7.3% |
| t | 599117 | 7.3% |
| l | 399534 | 4.9% |
| u | 399166 | 4.9% |
| Other values (8) | 1600225 |
| Value | Count | Frequency (%) |
| r | 30055 | |
| e | 24044 | |
| i | 24027 | |
| o | 23989 | |
| c | 18075 | 7.3% |
| t | 18030 | 7.3% |
| n | 18030 | 7.3% |
| s | 18009 | 7.3% |
| u | 12022 | 4.9% |
| l | 12019 | 4.9% |
| Other values (8) | 47933 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8196389 |
| Value | Count | Frequency (%) |
| (unknown) | 246233 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 999350 | |
| o | 800417 | |
| e | 799767 | |
| i | 799331 | |
| s | 600639 | 7.3% |
| c | 599726 | 7.3% |
| n | 599117 | 7.3% |
| t | 599117 | 7.3% |
| l | 399534 | 4.9% |
| u | 399166 | 4.9% |
| Other values (8) | 1600225 |
| Value | Count | Frequency (%) |
| r | 30055 | |
| e | 24044 | |
| i | 24027 | |
| o | 23989 | |
| c | 18075 | 7.3% |
| t | 18030 | 7.3% |
| n | 18030 | 7.3% |
| s | 18009 | 7.3% |
| u | 12022 | 4.9% |
| l | 12019 | 4.9% |
| Other values (8) | 47933 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8196389 |
| Value | Count | Frequency (%) |
| (unknown) | 246233 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 999350 | |
| o | 800417 | |
| e | 799767 | |
| i | 799331 | |
| s | 600639 | 7.3% |
| c | 599726 | 7.3% |
| n | 599117 | 7.3% |
| t | 599117 | 7.3% |
| l | 399534 | 4.9% |
| u | 399166 | 4.9% |
| Other values (8) | 1600225 |
| Value | Count | Frequency (%) |
| r | 30055 | |
| e | 24044 | |
| i | 24027 | |
| o | 23989 | |
| c | 18075 | 7.3% |
| t | 18030 | 7.3% |
| n | 18030 | 7.3% |
| s | 18009 | 7.3% |
| u | 12022 | 4.9% |
| l | 12019 | 4.9% |
| Other values (8) | 47933 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8196389 |
| Value | Count | Frequency (%) |
| (unknown) | 246233 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 999350 | |
| o | 800417 | |
| e | 799767 | |
| i | 799331 | |
| s | 600639 | 7.3% |
| c | 599726 | 7.3% |
| n | 599117 | 7.3% |
| t | 599117 | 7.3% |
| l | 399534 | 4.9% |
| u | 399166 | 4.9% |
| Other values (8) | 1600225 |
| Value | Count | Frequency (%) |
| r | 30055 | |
| e | 24044 | |
| i | 24027 | |
| o | 23989 | |
| c | 18075 | 7.3% |
| t | 18030 | 7.3% |
| n | 18030 | 7.3% |
| s | 18009 | 7.3% |
| u | 12022 | 4.9% |
| l | 12019 | 4.9% |
| Other values (8) | 47933 |
quantity
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 9 | 9 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 5.002649 | 5.000966667 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 9 | 9 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1 | 1 |
| Q1 | 3 | 3 |
| median | 5 | 5 |
| Q3 | 7 | 7 |
| 95-th percentile | 9 | 9 |
| Maximum | 9 | 9 |
| Range | 8 | 8 |
| Interquartile range (IQR) | 4 | 4 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 2.583751276 | 2.586101699 |
| Coefficient of variation (CV) | 0.516476626 | 0.5171203631 |
| Kurtosis | -1.231080652 | -1.234092138 |
| Mean | 5.002649 | 5.000966667 |
| Median Absolute Deviation (MAD) | 2 | 2 |
| Skewness | -0.0003647460673 | 0.003236533329 |
| Sum | 5002649 | 150029 |
| Variance | 6.675770659 | 6.687921996 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 111914 | |
| 3 | 111422 | |
| 7 | 111274 | |
| 1 | 111150 | |
| 4 | 111104 | |
| 6 | 111098 | |
| 2 | 110782 | |
| 8 | 110747 | |
| 5 | 110509 |
| Value | Count | Frequency (%) |
| 7 | 3386 | |
| 4 | 3378 | |
| 9 | 3377 | |
| 2 | 3352 | |
| 1 | 3327 | |
| 3 | 3326 | |
| 5 | 3321 | |
| 8 | 3305 | |
| 6 | 3228 |
| Value | Count | Frequency (%) |
| 1 | 111150 | |
| 2 | 110782 | |
| 3 | 111422 | |
| 4 | 111104 | |
| 5 | 110509 |
| Value | Count | Frequency (%) |
| 1 | 3327 | |
| 2 | 3352 | |
| 3 | 3326 | |
| 4 | 3378 | |
| 5 | 3321 |
| Value | Count | Frequency (%) |
| 1 | 3327 | |
| 2 | 3352 | |
| 3 | 3326 | |
| 4 | 3378 | |
| 5 | 3321 |
| Value | Count | Frequency (%) |
| 1 | 111150 | |
| 2 | 110782 | |
| 3 | 111422 | |
| 4 | 111104 | |
| 5 | 110509 |
unit_price
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 99896 | 25968 |
| Distinct (%) | 10.0% | 86.6% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 500.2613169 | 499.58365 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1.06 |
| Maximum | 1000 | 999.98 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1.06 |
| 5-th percentile | 50.72 | 49.436 |
| Q1 | 250.31 | 252.1225 |
| median | 500.41 | 499.915 |
| Q3 | 750.16 | 748.18 |
| 95-th percentile | 949.91 | 948.981 |
| Maximum | 1000 | 999.98 |
| Range | 999 | 998.92 |
| Interquartile range (IQR) | 499.85 | 496.0575 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 288.4628596 | 287.9481681 |
| Coefficient of variation (CV) | 0.5766243559 | 0.5763762846 |
| Kurtosis | -1.20144233 | -1.194704901 |
| Mean | 500.2613169 | 499.58365 |
| Median Absolute Deviation (MAD) | 249.93 | 248.13 |
| Skewness | -1.097330655 × 10-5 | -0.002188554165 |
| Sum | 500261316.9 | 14987509.5 |
| Variance | 83210.82139 | 82914.14749 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 226.51 | 28 | < 0.1% |
| 450.02 | 26 | < 0.1% |
| 591.8 | 25 | < 0.1% |
| 921.47 | 25 | < 0.1% |
| 354.83 | 25 | < 0.1% |
| 49.69 | 25 | < 0.1% |
| 111.41 | 24 | < 0.1% |
| 954.1 | 24 | < 0.1% |
| 619.19 | 24 | < 0.1% |
| 845.21 | 24 | < 0.1% |
| Other values (99886) | 999750 |
| Value | Count | Frequency (%) |
| 40.62 | 4 | < 0.1% |
| 602.8 | 4 | < 0.1% |
| 350.29 | 4 | < 0.1% |
| 352.5 | 4 | < 0.1% |
| 38.58 | 4 | < 0.1% |
| 573.88 | 4 | < 0.1% |
| 965.77 | 4 | < 0.1% |
| 957.31 | 4 | < 0.1% |
| 13.93 | 4 | < 0.1% |
| 999.13 | 4 | < 0.1% |
| Other values (25958) | 29960 |
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 1.01 | 9 | |
| 1.02 | 11 | |
| 1.03 | 8 | |
| 1.04 | 17 |
| Value | Count | Frequency (%) |
| 1.06 | 2 | |
| 1.09 | 1 | |
| 1.1 | 1 | |
| 1.12 | 1 | |
| 1.2 | 1 |
| Value | Count | Frequency (%) |
| 1.06 | 2 | |
| 1.09 | 1 | |
| 1.1 | 1 | |
| 1.12 | 1 | |
| 1.2 | 1 |
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 1.01 | 9 | |
| 1.02 | 11 | |
| 1.03 | 8 | |
| 1.04 | 17 |
discount_applied
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 51 | 51 |
| Distinct (%) | < 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.24991049 | 0.2486653333 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 0.5 | 0.5 |
| Zeros | 9967 | 317 |
| Zeros (%) | 1.0% | 1.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0.03 | 0.02 |
| Q1 | 0.13 | 0.12 |
| median | 0.25 | 0.25 |
| Q3 | 0.37 | 0.37 |
| 95-th percentile | 0.47 | 0.47 |
| Maximum | 0.5 | 0.5 |
| Range | 0.5 | 0.5 |
| Interquartile range (IQR) | 0.24 | 0.25 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 0.1443279083 | 0.1442589767 |
| Coefficient of variation (CV) | 0.5775184079 | 0.5801330437 |
| Kurtosis | -1.19713108 | -1.193243353 |
| Mean | 0.24991049 | 0.2486653333 |
| Median Absolute Deviation (MAD) | 0.12 | 0.13 |
| Skewness | 0.0002640976336 | 0.01196552323 |
| Sum | 249910.49 | 7459.96 |
| Variance | 0.02083054512 | 0.02081065235 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0.19 | 20302 | 2.0% |
| 0.06 | 20213 | 2.0% |
| 0.34 | 20211 | 2.0% |
| 0.03 | 20207 | 2.0% |
| 0.05 | 20207 | 2.0% |
| 0.21 | 20199 | 2.0% |
| 0.29 | 20155 | 2.0% |
| 0.07 | 20153 | 2.0% |
| 0.43 | 20145 | 2.0% |
| 0.18 | 20111 | 2.0% |
| Other values (41) | 798097 |
| Value | Count | Frequency (%) |
| 0.07 | 659 | 2.2% |
| 0.39 | 642 | 2.1% |
| 0.19 | 636 | 2.1% |
| 0.21 | 636 | 2.1% |
| 0.12 | 627 | 2.1% |
| 0.43 | 627 | 2.1% |
| 0.25 | 625 | 2.1% |
| 0.17 | 624 | 2.1% |
| 0.01 | 623 | 2.1% |
| 0.18 | 622 | 2.1% |
| Other values (41) | 23679 |
| Value | Count | Frequency (%) |
| 0 | 9967 | |
| 0.01 | 20018 | |
| 0.02 | 19788 | |
| 0.03 | 20207 | |
| 0.04 | 19947 |
| Value | Count | Frequency (%) |
| 0 | 317 | |
| 0.01 | 623 | |
| 0.02 | 594 | |
| 0.03 | 585 | |
| 0.04 | 604 |
| Value | Count | Frequency (%) |
| 0 | 317 | |
| 0.01 | 623 | |
| 0.02 | 594 | |
| 0.03 | 585 | |
| 0.04 | 604 |
| Value | Count | Frequency (%) |
| 0 | 9967 | |
| 0.01 | 20018 | |
| 0.02 | 19788 | |
| 0.03 | 20207 | |
| 0.04 | 19947 |
payment_method
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 14 | 14 |
| Median length | 11 | 11 |
| Mean length | 9.751935 | 9.763566667 |
| Min length | 4 | 4 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Credit Card | Credit Card |
| 2nd row | Credit Card | Mobile Payment |
| 3rd row | Debit Card | Cash |
| 4th row | Credit Card | Cash |
| 5th row | Mobile Payment | Cash |
| Value | Count | Frequency (%) |
| card | 500200 | |
| credit | 250435 | |
| mobile | 250030 | |
| payment | 250030 | |
| cash | 249770 | |
| debit | 249765 |
| Value | Count | Frequency (%) |
| card | 15048 | |
| debit | 7579 | |
| mobile | 7515 | |
| payment | 7515 | |
| credit | 7469 | |
| cash | 7437 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1000405 | |
| e | 1000260 | |
| a | 1000000 | |
| r | 750635 | 7.7% |
| d | 750635 | 7.7% |
| t | 750230 | 7.7% |
| i | 750230 | 7.7% |
| 750230 | 7.7% | |
| b | 499795 | 5.1% |
| M | 250030 | 2.6% |
| Other values (9) | 2249485 |
| Value | Count | Frequency (%) |
| e | 30078 | |
| a | 30000 | |
| C | 29954 | |
| 22563 | 7.7% | |
| i | 22563 | 7.7% |
| t | 22563 | 7.7% |
| r | 22517 | 7.7% |
| d | 22517 | 7.7% |
| b | 15094 | 5.2% |
| D | 7579 | 2.6% |
| Other values (9) | 67479 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9751935 |
| Value | Count | Frequency (%) |
| (unknown) | 292907 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 1000405 | |
| e | 1000260 | |
| a | 1000000 | |
| r | 750635 | 7.7% |
| d | 750635 | 7.7% |
| t | 750230 | 7.7% |
| i | 750230 | 7.7% |
| 750230 | 7.7% | |
| b | 499795 | 5.1% |
| M | 250030 | 2.6% |
| Other values (9) | 2249485 |
| Value | Count | Frequency (%) |
| e | 30078 | |
| a | 30000 | |
| C | 29954 | |
| 22563 | 7.7% | |
| i | 22563 | 7.7% |
| t | 22563 | 7.7% |
| r | 22517 | 7.7% |
| d | 22517 | 7.7% |
| b | 15094 | 5.2% |
| D | 7579 | 2.6% |
| Other values (9) | 67479 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9751935 |
| Value | Count | Frequency (%) |
| (unknown) | 292907 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 1000405 | |
| e | 1000260 | |
| a | 1000000 | |
| r | 750635 | 7.7% |
| d | 750635 | 7.7% |
| t | 750230 | 7.7% |
| i | 750230 | 7.7% |
| 750230 | 7.7% | |
| b | 499795 | 5.1% |
| M | 250030 | 2.6% |
| Other values (9) | 2249485 |
| Value | Count | Frequency (%) |
| e | 30078 | |
| a | 30000 | |
| C | 29954 | |
| 22563 | 7.7% | |
| i | 22563 | 7.7% |
| t | 22563 | 7.7% |
| r | 22517 | 7.7% |
| d | 22517 | 7.7% |
| b | 15094 | 5.2% |
| D | 7579 | 2.6% |
| Other values (9) | 67479 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9751935 |
| Value | Count | Frequency (%) |
| (unknown) | 292907 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 1000405 | |
| e | 1000260 | |
| a | 1000000 | |
| r | 750635 | 7.7% |
| d | 750635 | 7.7% |
| t | 750230 | 7.7% |
| i | 750230 | 7.7% |
| 750230 | 7.7% | |
| b | 499795 | 5.1% |
| M | 250030 | 2.6% |
| Other values (9) | 2249485 |
| Value | Count | Frequency (%) |
| e | 30078 | |
| a | 30000 | |
| C | 29954 | |
| 22563 | 7.7% | |
| i | 22563 | 7.7% |
| t | 22563 | 7.7% |
| r | 22517 | 7.7% |
| d | 22517 | 7.7% |
| b | 15094 | 5.2% |
| D | 7579 | 2.6% |
| Other values (9) | 67479 |
store_location
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 10 | 10 |
| Median length | 10 | 10 |
| Mean length | 10 | 10 |
| Min length | 10 | 10 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Location A | Location A |
| 2nd row | Location C | Location A |
| 3rd row | Location A | Location C |
| 4th row | Location A | Location A |
| 5th row | Location C | Location B |
| Value | Count | Frequency (%) |
| location | 1000000 | |
| c | 250336 | 12.5% |
| b | 250280 | 12.5% |
| a | 250150 | 12.5% |
| d | 249234 | 12.5% |
| Value | Count | Frequency (%) |
| location | 30000 | |
| a | 7612 | 12.7% |
| c | 7515 | 12.5% |
| b | 7462 | 12.4% |
| d | 7411 | 12.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| C | 250336 | 2.5% |
| B | 250280 | 2.5% |
| Other values (2) | 499384 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7612 | 2.5% |
| C | 7515 | 2.5% |
| Other values (2) | 14873 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| C | 250336 | 2.5% |
| B | 250280 | 2.5% |
| Other values (2) | 499384 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7612 | 2.5% |
| C | 7515 | 2.5% |
| Other values (2) | 14873 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| C | 250336 | 2.5% |
| B | 250280 | 2.5% |
| Other values (2) | 499384 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7612 | 2.5% |
| C | 7515 | 2.5% |
| Other values (2) | 14873 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| C | 250336 | 2.5% |
| B | 250280 | 2.5% |
| Other values (2) | 499384 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7612 | 2.5% |
| C | 7515 | 2.5% |
| Other values (2) | 14873 | 5.0% |
transaction_hour
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 24 | 24 |
| Distinct (%) | < 0.1% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 11.505193 | 11.48146667 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 23 | 23 |
| Zeros | 41756 | 1312 |
| Zeros (%) | 4.2% | 4.4% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 1 | 1 |
| Q1 | 5 | 5 |
| median | 12 | 12 |
| Q3 | 18 | 17 |
| 95-th percentile | 22 | 22 |
| Maximum | 23 | 23 |
| Range | 23 | 23 |
| Interquartile range (IQR) | 13 | 12 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 6.924459761 | 6.931358066 |
| Coefficient of variation (CV) | 0.6018551589 | 0.6036997073 |
| Kurtosis | -1.205305317 | -1.205421909 |
| Mean | 11.505193 | 11.48146667 |
| Median Absolute Deviation (MAD) | 6 | 6 |
| Skewness | -0.001531297707 | -0.004865786085 |
| Sum | 11505193 | 344444 |
| Variance | 47.94814298 | 48.04372464 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 42166 | 4.2% |
| 14 | 42161 | 4.2% |
| 18 | 41872 | 4.2% |
| 20 | 41812 | 4.2% |
| 3 | 41780 | 4.2% |
| 21 | 41778 | 4.2% |
| 4 | 41756 | 4.2% |
| 0 | 41756 | 4.2% |
| 23 | 41750 | 4.2% |
| 19 | 41707 | 4.2% |
| Other values (14) | 581462 |
| Value | Count | Frequency (%) |
| 0 | 1312 | 4.4% |
| 16 | 1300 | 4.3% |
| 6 | 1285 | 4.3% |
| 17 | 1284 | 4.3% |
| 13 | 1281 | 4.3% |
| 3 | 1279 | 4.3% |
| 1 | 1267 | 4.2% |
| 22 | 1259 | 4.2% |
| 12 | 1259 | 4.2% |
| 20 | 1253 | 4.2% |
| Other values (14) | 17221 |
| Value | Count | Frequency (%) |
| 0 | 41756 | |
| 1 | 41637 | |
| 2 | 41388 | |
| 3 | 41780 | |
| 4 | 41756 |
| Value | Count | Frequency (%) |
| 0 | 1312 | |
| 1 | 1267 | |
| 2 | 1213 | |
| 3 | 1279 | |
| 4 | 1213 |
| Value | Count | Frequency (%) |
| 0 | 1312 | |
| 1 | 1267 | |
| 2 | 1213 | |
| 3 | 1279 | |
| 4 | 1213 |
| Value | Count | Frequency (%) |
| 0 | 41756 | |
| 1 | 41637 | |
| 2 | 41388 | |
| 3 | 41780 | |
| 4 | 41756 |
day_of_week
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 7 | 7 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 9 | 9 |
| Median length | 8 | 8 |
| Mean length | 7.141075 | 7.1391 |
| Min length | 6 | 6 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Wednesday | Wednesday |
| 2nd row | Friday | Friday |
| 3rd row | Saturday | Friday |
| 4th row | Friday | Friday |
| 5th row | Monday | Wednesday |
| Value | Count | Frequency (%) |
| tuesday | 143452 | |
| friday | 143067 | |
| thursday | 142930 | |
| sunday | 142875 | |
| monday | 142855 | |
| saturday | 142700 | |
| wednesday | 142121 |
| Value | Count | Frequency (%) |
| sunday | 4392 | |
| saturday | 4301 | |
| thursday | 4271 | |
| friday | 4269 | |
| wednesday | 4258 | |
| tuesday | 4255 | |
| monday | 4254 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1142700 | |
| d | 1142121 | |
| y | 1000000 | |
| u | 571957 | |
| r | 428697 | 6.0% |
| s | 428503 | 6.0% |
| n | 427851 | 6.0% |
| e | 427694 | 6.0% |
| T | 286382 | 4.0% |
| S | 285575 | 4.0% |
| Other values (7) | 999595 |
| Value | Count | Frequency (%) |
| a | 34301 | |
| d | 34258 | |
| y | 30000 | |
| u | 17219 | |
| n | 12904 | 6.0% |
| r | 12841 | 6.0% |
| s | 12784 | 6.0% |
| e | 12771 | 6.0% |
| S | 8693 | 4.1% |
| T | 8526 | 4.0% |
| Other values (7) | 29876 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7141075 |
| Value | Count | Frequency (%) |
| (unknown) | 214173 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1142700 | |
| d | 1142121 | |
| y | 1000000 | |
| u | 571957 | |
| r | 428697 | 6.0% |
| s | 428503 | 6.0% |
| n | 427851 | 6.0% |
| e | 427694 | 6.0% |
| T | 286382 | 4.0% |
| S | 285575 | 4.0% |
| Other values (7) | 999595 |
| Value | Count | Frequency (%) |
| a | 34301 | |
| d | 34258 | |
| y | 30000 | |
| u | 17219 | |
| n | 12904 | 6.0% |
| r | 12841 | 6.0% |
| s | 12784 | 6.0% |
| e | 12771 | 6.0% |
| S | 8693 | 4.1% |
| T | 8526 | 4.0% |
| Other values (7) | 29876 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7141075 |
| Value | Count | Frequency (%) |
| (unknown) | 214173 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1142700 | |
| d | 1142121 | |
| y | 1000000 | |
| u | 571957 | |
| r | 428697 | 6.0% |
| s | 428503 | 6.0% |
| n | 427851 | 6.0% |
| e | 427694 | 6.0% |
| T | 286382 | 4.0% |
| S | 285575 | 4.0% |
| Other values (7) | 999595 |
| Value | Count | Frequency (%) |
| a | 34301 | |
| d | 34258 | |
| y | 30000 | |
| u | 17219 | |
| n | 12904 | 6.0% |
| r | 12841 | 6.0% |
| s | 12784 | 6.0% |
| e | 12771 | 6.0% |
| S | 8693 | 4.1% |
| T | 8526 | 4.0% |
| Other values (7) | 29876 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7141075 |
| Value | Count | Frequency (%) |
| (unknown) | 214173 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1142700 | |
| d | 1142121 | |
| y | 1000000 | |
| u | 571957 | |
| r | 428697 | 6.0% |
| s | 428503 | 6.0% |
| n | 427851 | 6.0% |
| e | 427694 | 6.0% |
| T | 286382 | 4.0% |
| S | 285575 | 4.0% |
| Other values (7) | 999595 |
| Value | Count | Frequency (%) |
| a | 34301 | |
| d | 34258 | |
| y | 30000 | |
| u | 17219 | |
| n | 12904 | 6.0% |
| r | 12841 | 6.0% |
| s | 12784 | 6.0% |
| e | 12771 | 6.0% |
| S | 8693 | 4.1% |
| T | 8526 | 4.0% |
| Other values (7) | 29876 |
week_of_year
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 52 | 52 |
| Distinct (%) | < 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 26.503691 | 26.4998 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 52 | 52 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 3 | 3 |
| Q1 | 14 | 14 |
| median | 27 | 27 |
| Q3 | 39 | 39.25 |
| 95-th percentile | 50 | 50 |
| Maximum | 52 | 52 |
| Range | 51 | 51 |
| Interquartile range (IQR) | 25 | 25.25 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 15.00516516 | 15.02027475 |
| Coefficient of variation (CV) | 0.566153792 | 0.5668070986 |
| Kurtosis | -1.199199248 | -1.198848872 |
| Mean | 26.503691 | 26.4998 |
| Median Absolute Deviation (MAD) | 13 | 13 |
| Skewness | -0.0005909978351 | -0.005785736997 |
| Sum | 26503691 | 794994 |
| Variance | 225.1549815 | 225.6086536 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 27 | 19588 | 2.0% |
| 19 | 19507 | 2.0% |
| 51 | 19447 | 1.9% |
| 26 | 19425 | 1.9% |
| 1 | 19399 | 1.9% |
| 25 | 19386 | 1.9% |
| 21 | 19371 | 1.9% |
| 44 | 19356 | 1.9% |
| 16 | 19348 | 1.9% |
| 9 | 19340 | 1.9% |
| Other values (42) | 805833 |
| Value | Count | Frequency (%) |
| 19 | 624 | 2.1% |
| 2 | 622 | 2.1% |
| 46 | 611 | 2.0% |
| 48 | 611 | 2.0% |
| 44 | 605 | 2.0% |
| 33 | 603 | 2.0% |
| 8 | 603 | 2.0% |
| 28 | 601 | 2.0% |
| 24 | 601 | 2.0% |
| 10 | 598 | 2.0% |
| Other values (42) | 23921 |
| Value | Count | Frequency (%) |
| 1 | 19399 | |
| 2 | 19179 | |
| 3 | 19150 | |
| 4 | 19137 | |
| 5 | 19328 |
| Value | Count | Frequency (%) |
| 1 | 584 | |
| 2 | 622 | |
| 3 | 594 | |
| 4 | 581 | |
| 5 | 561 |
| Value | Count | Frequency (%) |
| 1 | 584 | |
| 2 | 622 | |
| 3 | 594 | |
| 4 | 581 | |
| 5 | 561 |
| Value | Count | Frequency (%) |
| 1 | 19399 | |
| 2 | 19179 | |
| 3 | 19150 | |
| 4 | 19137 | |
| 5 | 19328 |
month_of_year
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 12 | 12 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 6.497467 | 6.524033333 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 12 | 12 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1 | 1 |
| Q1 | 3 | 4 |
| median | 7 | 7 |
| Q3 | 10 | 10 |
| 95-th percentile | 12 | 12 |
| Maximum | 12 | 12 |
| Range | 11 | 11 |
| Interquartile range (IQR) | 7 | 6 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 3.455211936 | 3.457719035 |
| Coefficient of variation (CV) | 0.531778297 | 0.5299971442 |
| Kurtosis | -1.21903855 | -1.225186228 |
| Mean | 6.497467 | 6.524033333 |
| Median Absolute Deviation (MAD) | 3 | 3 |
| Skewness | 0.0003820436436 | -0.01077981305 |
| Sum | 6497467 | 195721 |
| Variance | 11.93848952 | 11.95582093 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 83951 | |
| 11 | 83645 | |
| 1 | 83624 | |
| 7 | 83475 | |
| 12 | 83353 | |
| 9 | 83328 | |
| 3 | 83244 | |
| 5 | 83135 | |
| 10 | 83113 | |
| 8 | 83093 | |
| Other values (2) | 166039 |
| Value | Count | Frequency (%) |
| 10 | 2589 | |
| 9 | 2562 | |
| 2 | 2553 | |
| 12 | 2535 | |
| 8 | 2496 | |
| 6 | 2492 | |
| 3 | 2478 | |
| 11 | 2475 | |
| 4 | 2470 | |
| 5 | 2466 | |
| Other values (2) | 4884 |
| Value | Count | Frequency (%) |
| 1 | 83624 | |
| 2 | 83951 | |
| 3 | 83244 | |
| 4 | 83091 | |
| 5 | 83135 |
| Value | Count | Frequency (%) |
| 1 | 2455 | |
| 2 | 2553 | |
| 3 | 2478 | |
| 4 | 2470 | |
| 5 | 2466 |
| Value | Count | Frequency (%) |
| 1 | 2455 | |
| 2 | 2553 | |
| 3 | 2478 | |
| 4 | 2470 | |
| 5 | 2466 |
| Value | Count | Frequency (%) |
| 1 | 83624 | |
| 2 | 83951 | |
| 3 | 83244 | |
| 4 | 83091 | |
| 5 | 83135 |
avg_purchase_value
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 49001 | 22551 |
| Distinct (%) | 4.9% | 75.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 254.8864443 | 256.1898827 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10 | 10 |
| Maximum | 500 | 499.96 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10 | 10 |
| 5-th percentile | 34.4 | 34.4295 |
| Q1 | 132.22 | 133.445 |
| median | 254.93 | 256.425 |
| Q3 | 377.35 | 377.93 |
| 95-th percentile | 475.56 | 476.8705 |
| Maximum | 500 | 499.96 |
| Range | 490 | 489.96 |
| Interquartile range (IQR) | 245.13 | 244.485 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 141.4949233 | 141.9802413 |
| Coefficient of variation (CV) | 0.5551292604 | 0.5541992518 |
| Kurtosis | -1.200170422 | -1.197255262 |
| Mean | 254.8864443 | 256.1898827 |
| Median Absolute Deviation (MAD) | 122.57 | 122.275 |
| Skewness | 0.0003762833586 | -0.008478684966 |
| Sum | 254886444.3 | 7685696.48 |
| Variance | 20020.81333 | 20158.38892 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 76.54 | 41 | < 0.1% |
| 482.75 | 41 | < 0.1% |
| 372.04 | 40 | < 0.1% |
| 397.45 | 39 | < 0.1% |
| 246.87 | 39 | < 0.1% |
| 60.53 | 38 | < 0.1% |
| 278.34 | 38 | < 0.1% |
| 315.26 | 38 | < 0.1% |
| 165.81 | 38 | < 0.1% |
| 492.47 | 38 | < 0.1% |
| Other values (48991) | 999610 |
| Value | Count | Frequency (%) |
| 236.57 | 6 | < 0.1% |
| 65.97 | 6 | < 0.1% |
| 291.76 | 6 | < 0.1% |
| 372.13 | 6 | < 0.1% |
| 335.61 | 5 | < 0.1% |
| 170.11 | 5 | < 0.1% |
| 58.11 | 5 | < 0.1% |
| 134.34 | 5 | < 0.1% |
| 230.67 | 5 | < 0.1% |
| 392.45 | 5 | < 0.1% |
| Other values (22541) | 29946 |
| Value | Count | Frequency (%) |
| 10 | 8 | < 0.1% |
| 10.01 | 23 | |
| 10.02 | 29 | |
| 10.03 | 17 | |
| 10.04 | 21 |
| Value | Count | Frequency (%) |
| 10 | 1 | |
| 10.03 | 1 | |
| 10.04 | 1 | |
| 10.05 | 1 | |
| 10.07 | 2 |
| Value | Count | Frequency (%) |
| 10 | 1 | |
| 10.03 | 1 | |
| 10.04 | 1 | |
| 10.05 | 1 | |
| 10.07 | 2 |
| Value | Count | Frequency (%) |
| 10 | 8 | < 0.1% |
| 10.01 | 23 | |
| 10.02 | 29 | |
| 10.03 | 17 | |
| 10.04 | 21 |
purchase_frequency
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 6 | 6 |
| Mean length | 6.000399 | 5.993833333 |
| Min length | 5 | 5 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Weekly | Weekly |
| 2nd row | Daily | Yearly |
| 3rd row | Weekly | Yearly |
| 4th row | Weekly | Weekly |
| 5th row | Yearly | Monthly |
| Value | Count | Frequency (%) |
| yearly | 250767 | |
| monthly | 249932 | |
| weekly | 249768 | |
| daily | 249533 |
| Value | Count | Frequency (%) |
| daily | 7569 | |
| yearly | 7529 | |
| weekly | 7518 | |
| monthly | 7384 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1000000 | |
| y | 1000000 | |
| e | 750303 | |
| a | 500300 | |
| Y | 250767 | 4.2% |
| r | 250767 | 4.2% |
| M | 249932 | 4.2% |
| o | 249932 | 4.2% |
| n | 249932 | 4.2% |
| t | 249932 | 4.2% |
| Other values (5) | 1248534 |
| Value | Count | Frequency (%) |
| l | 30000 | |
| y | 30000 | |
| e | 22565 | |
| a | 15098 | |
| D | 7569 | 4.2% |
| i | 7569 | 4.2% |
| Y | 7529 | 4.2% |
| r | 7529 | 4.2% |
| W | 7518 | 4.2% |
| k | 7518 | 4.2% |
| Other values (5) | 36920 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6000399 |
| Value | Count | Frequency (%) |
| (unknown) | 179815 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 1000000 | |
| y | 1000000 | |
| e | 750303 | |
| a | 500300 | |
| Y | 250767 | 4.2% |
| r | 250767 | 4.2% |
| M | 249932 | 4.2% |
| o | 249932 | 4.2% |
| n | 249932 | 4.2% |
| t | 249932 | 4.2% |
| Other values (5) | 1248534 |
| Value | Count | Frequency (%) |
| l | 30000 | |
| y | 30000 | |
| e | 22565 | |
| a | 15098 | |
| D | 7569 | 4.2% |
| i | 7569 | 4.2% |
| Y | 7529 | 4.2% |
| r | 7529 | 4.2% |
| W | 7518 | 4.2% |
| k | 7518 | 4.2% |
| Other values (5) | 36920 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6000399 |
| Value | Count | Frequency (%) |
| (unknown) | 179815 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 1000000 | |
| y | 1000000 | |
| e | 750303 | |
| a | 500300 | |
| Y | 250767 | 4.2% |
| r | 250767 | 4.2% |
| M | 249932 | 4.2% |
| o | 249932 | 4.2% |
| n | 249932 | 4.2% |
| t | 249932 | 4.2% |
| Other values (5) | 1248534 |
| Value | Count | Frequency (%) |
| l | 30000 | |
| y | 30000 | |
| e | 22565 | |
| a | 15098 | |
| D | 7569 | 4.2% |
| i | 7569 | 4.2% |
| Y | 7529 | 4.2% |
| r | 7529 | 4.2% |
| W | 7518 | 4.2% |
| k | 7518 | 4.2% |
| Other values (5) | 36920 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6000399 |
| Value | Count | Frequency (%) |
| (unknown) | 179815 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 1000000 | |
| y | 1000000 | |
| e | 750303 | |
| a | 500300 | |
| Y | 250767 | 4.2% |
| r | 250767 | 4.2% |
| M | 249932 | 4.2% |
| o | 249932 | 4.2% |
| n | 249932 | 4.2% |
| t | 249932 | 4.2% |
| Other values (5) | 1248534 |
| Value | Count | Frequency (%) |
| l | 30000 | |
| y | 30000 | |
| e | 22565 | |
| a | 15098 | |
| D | 7569 | 4.2% |
| i | 7569 | 4.2% |
| Y | 7529 | 4.2% |
| r | 7529 | 4.2% |
| W | 7518 | 4.2% |
| k | 7518 | 4.2% |
| Other values (5) | 36920 |
last_purchase_date
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 984242 | 29978 |
| Distinct (%) | 98.4% | 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 968656 | 29956 ? |
| Unique (%) | 96.9% | 99.9% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | 2021-09-11 04:22:38 | 2021-09-11 04:22:38 |
| 2nd row | 2021-05-16 12:01:16 | 2021-07-06 13:17:09 |
| 3rd row | 2021-02-07 16:47:48 | 2021-12-20 08:00:19 |
| 4th row | 2021-12-30 23:48:26 | 2021-12-02 07:30:26 |
| 5th row | 2021-11-02 11:48:25 | 2021-10-27 01:27:24 |
| Value | Count | Frequency (%) |
| 2021-01-02 | 2870 | 0.1% |
| 2021-05-14 | 2866 | 0.1% |
| 2021-12-25 | 2860 | 0.1% |
| 2021-01-17 | 2860 | 0.1% |
| 2021-10-17 | 2856 | 0.1% |
| 2021-01-26 | 2856 | 0.1% |
| 2021-08-16 | 2854 | 0.1% |
| 2021-05-05 | 2852 | 0.1% |
| 2021-10-16 | 2850 | 0.1% |
| 2021-09-17 | 2849 | 0.1% |
| Other values (86754) | 1971427 |
| Value | Count | Frequency (%) |
| 2021-11-25 | 110 | 0.2% |
| 2021-05-04 | 107 | 0.2% |
| 2021-06-17 | 104 | 0.2% |
| 2021-03-13 | 104 | 0.2% |
| 2021-03-12 | 104 | 0.2% |
| 2021-08-02 | 104 | 0.2% |
| 2021-08-22 | 103 | 0.2% |
| 2021-03-01 | 103 | 0.2% |
| 2021-04-22 | 102 | 0.2% |
| 2021-09-18 | 101 | 0.2% |
| Other values (25779) | 58958 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3411647 | |
| 0 | 3298747 | |
| 1 | 2941758 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 891719 | 4.7% |
| 5 | 800287 | 4.2% |
| 4 | 796296 | 4.2% |
| 7 | 466826 | 2.5% |
| Other values (3) | 1392720 |
| Value | Count | Frequency (%) |
| 2 | 102083 | |
| 0 | 99290 | |
| 1 | 88049 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26911 | 4.7% |
| 5 | 24014 | 4.2% |
| 4 | 23745 | 4.2% |
| 6 | 14062 | 2.5% |
| Other values (3) | 41846 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411647 | |
| 0 | 3298747 | |
| 1 | 2941758 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 891719 | 4.7% |
| 5 | 800287 | 4.2% |
| 4 | 796296 | 4.2% |
| 7 | 466826 | 2.5% |
| Other values (3) | 1392720 |
| Value | Count | Frequency (%) |
| 2 | 102083 | |
| 0 | 99290 | |
| 1 | 88049 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26911 | 4.7% |
| 5 | 24014 | 4.2% |
| 4 | 23745 | 4.2% |
| 6 | 14062 | 2.5% |
| Other values (3) | 41846 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411647 | |
| 0 | 3298747 | |
| 1 | 2941758 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 891719 | 4.7% |
| 5 | 800287 | 4.2% |
| 4 | 796296 | 4.2% |
| 7 | 466826 | 2.5% |
| Other values (3) | 1392720 |
| Value | Count | Frequency (%) |
| 2 | 102083 | |
| 0 | 99290 | |
| 1 | 88049 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26911 | 4.7% |
| 5 | 24014 | 4.2% |
| 4 | 23745 | 4.2% |
| 6 | 14062 | 2.5% |
| Other values (3) | 41846 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411647 | |
| 0 | 3298747 | |
| 1 | 2941758 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 891719 | 4.7% |
| 5 | 800287 | 4.2% |
| 4 | 796296 | 4.2% |
| 7 | 466826 | 2.5% |
| Other values (3) | 1392720 |
| Value | Count | Frequency (%) |
| 2 | 102083 | |
| 0 | 99290 | |
| 1 | 88049 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26911 | 4.7% |
| 5 | 24014 | 4.2% |
| 4 | 23745 | 4.2% |
| 6 | 14062 | 2.5% |
| Other values (3) | 41846 |
avg_discount_used
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 51 | 51 |
| Distinct (%) | < 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.25001009 | 0.2499986667 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 0.5 | 0.5 |
| Zeros | 10010 | 294 |
| Zeros (%) | 1.0% | 1.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0.03 | 0.02 |
| Q1 | 0.13 | 0.13 |
| median | 0.25 | 0.25 |
| Q3 | 0.38 | 0.38 |
| 95-th percentile | 0.47 | 0.48 |
| Maximum | 0.5 | 0.5 |
| Range | 0.5 | 0.5 |
| Interquartile range (IQR) | 0.25 | 0.25 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 0.1443825628 | 0.1446056604 |
| Coefficient of variation (CV) | 0.5775069431 | 0.5784257266 |
| Kurtosis | -1.19810725 | -1.198616805 |
| Mean | 0.25001009 | 0.2499986667 |
| Median Absolute Deviation (MAD) | 0.12 | 0.13 |
| Skewness | 0.0002818589406 | -0.001666434333 |
| Sum | 250010.09 | 7499.96 |
| Variance | 0.02084632444 | 0.02091079702 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0.39 | 20194 | 2.0% |
| 0.15 | 20188 | 2.0% |
| 0.08 | 20140 | 2.0% |
| 0.21 | 20138 | 2.0% |
| 0.34 | 20131 | 2.0% |
| 0.47 | 20125 | 2.0% |
| 0.05 | 20124 | 2.0% |
| 0.16 | 20123 | 2.0% |
| 0.46 | 20109 | 2.0% |
| 0.32 | 20093 | 2.0% |
| Other values (41) | 798635 |
| Value | Count | Frequency (%) |
| 0.26 | 639 | 2.1% |
| 0.13 | 637 | 2.1% |
| 0.08 | 636 | 2.1% |
| 0.46 | 636 | 2.1% |
| 0.28 | 632 | 2.1% |
| 0.31 | 628 | 2.1% |
| 0.14 | 626 | 2.1% |
| 0.03 | 625 | 2.1% |
| 0.05 | 621 | 2.1% |
| 0.39 | 620 | 2.1% |
| Other values (41) | 23700 |
| Value | Count | Frequency (%) |
| 0 | 10010 | |
| 0.01 | 19893 | |
| 0.02 | 19951 | |
| 0.03 | 19949 | |
| 0.04 | 20004 |
| Value | Count | Frequency (%) |
| 0 | 294 | |
| 0.01 | 617 | |
| 0.02 | 607 | |
| 0.03 | 625 | |
| 0.04 | 599 |
| Value | Count | Frequency (%) |
| 0 | 294 | |
| 0.01 | 617 | |
| 0.02 | 607 | |
| 0.03 | 625 | |
| 0.04 | 599 |
| Value | Count | Frequency (%) |
| 0 | 10010 | |
| 0.01 | 19893 | |
| 0.02 | 19951 | |
| 0.03 | 19949 | |
| 0.04 | 20004 |
preferred_store
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 10 | 10 |
| Median length | 10 | 10 |
| Mean length | 10 | 10 |
| Min length | 10 | 10 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Location A | Location A |
| 2nd row | Location C | Location D |
| 3rd row | Location B | Location A |
| 4th row | Location B | Location B |
| 5th row | Location B | Location C |
| Value | Count | Frequency (%) |
| location | 1000000 | |
| b | 250262 | 12.5% |
| d | 250007 | 12.5% |
| a | 249949 | 12.5% |
| c | 249782 | 12.5% |
| Value | Count | Frequency (%) |
| location | 30000 | |
| a | 7580 | 12.6% |
| b | 7569 | 12.6% |
| c | 7467 | 12.4% |
| d | 7384 | 12.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| B | 250262 | 2.5% |
| D | 250007 | 2.5% |
| Other values (2) | 499731 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7580 | 2.5% |
| B | 7569 | 2.5% |
| Other values (2) | 14851 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| B | 250262 | 2.5% |
| D | 250007 | 2.5% |
| Other values (2) | 499731 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7580 | 2.5% |
| B | 7569 | 2.5% |
| Other values (2) | 14851 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| B | 250262 | 2.5% |
| D | 250007 | 2.5% |
| Other values (2) | 499731 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7580 | 2.5% |
| B | 7569 | 2.5% |
| Other values (2) | 14851 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000000 |
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 2000000 | |
| L | 1000000 | |
| c | 1000000 | |
| a | 1000000 | |
| t | 1000000 | |
| i | 1000000 | |
| n | 1000000 | |
| 1000000 | ||
| B | 250262 | 2.5% |
| D | 250007 | 2.5% |
| Other values (2) | 499731 | 5.0% |
| Value | Count | Frequency (%) |
| o | 60000 | |
| L | 30000 | |
| c | 30000 | |
| a | 30000 | |
| t | 30000 | |
| i | 30000 | |
| n | 30000 | |
| 30000 | ||
| A | 7580 | 2.5% |
| B | 7569 | 2.5% |
| Other values (2) | 14851 | 5.0% |
online_purchases
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 100 | 100 |
| Distinct (%) | < 0.1% | 0.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.446018 | 49.18843333 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 99 | 99 |
| Zeros | 9997 | 298 |
| Zeros (%) | 1.0% | 1.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 4 | 4 |
| Q1 | 24 | 24 |
| median | 49 | 49 |
| Q3 | 74 | 74 |
| 95-th percentile | 94 | 94 |
| Maximum | 99 | 99 |
| Range | 99 | 99 |
| Interquartile range (IQR) | 50 | 50 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 28.86143913 | 28.91133668 |
| Coefficient of variation (CV) | 0.5836959234 | 0.5877669752 |
| Kurtosis | -1.200754846 | -1.207525653 |
| Mean | 49.446018 | 49.18843333 |
| Median Absolute Deviation (MAD) | 25 | 25 |
| Skewness | 0.00143421854 | 0.01496678687 |
| Sum | 49446018 | 1475653 |
| Variance | 832.9826689 | 835.8653884 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 10324 | 1.0% |
| 28 | 10269 | 1.0% |
| 40 | 10198 | 1.0% |
| 67 | 10151 | 1.0% |
| 76 | 10150 | 1.0% |
| 61 | 10150 | 1.0% |
| 52 | 10140 | 1.0% |
| 88 | 10134 | 1.0% |
| 43 | 10133 | 1.0% |
| 45 | 10132 | 1.0% |
| Other values (90) | 898219 |
| Value | Count | Frequency (%) |
| 32 | 339 | 1.1% |
| 76 | 336 | 1.1% |
| 27 | 335 | 1.1% |
| 69 | 333 | 1.1% |
| 49 | 332 | 1.1% |
| 86 | 332 | 1.1% |
| 68 | 329 | 1.1% |
| 45 | 328 | 1.1% |
| 17 | 327 | 1.1% |
| 18 | 327 | 1.1% |
| Other values (90) | 26682 |
| Value | Count | Frequency (%) |
| 0 | 9997 | |
| 1 | 10023 | |
| 2 | 9792 | |
| 3 | 10091 | |
| 4 | 10324 |
| Value | Count | Frequency (%) |
| 0 | 298 | |
| 1 | 292 | |
| 2 | 321 | |
| 3 | 276 | |
| 4 | 320 |
| Value | Count | Frequency (%) |
| 0 | 298 | |
| 1 | 292 | |
| 2 | 321 | |
| 3 | 276 | |
| 4 | 320 |
| Value | Count | Frequency (%) |
| 0 | 9997 | |
| 1 | 10023 | |
| 2 | 9792 | |
| 3 | 10091 | |
| 4 | 10324 |
in_store_purchases
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 100 | 100 |
| Distinct (%) | < 0.1% | 0.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.484486 | 49.34656667 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 99 | 99 |
| Zeros | 10016 | 317 |
| Zeros (%) | 1.0% | 1.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 5 | 4 |
| Q1 | 24 | 24 |
| median | 49 | 49 |
| Q3 | 75 | 74 |
| 95-th percentile | 95 | 95 |
| Maximum | 99 | 99 |
| Range | 99 | 99 |
| Interquartile range (IQR) | 51 | 50 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 28.88271174 | 28.89541411 |
| Coefficient of variation (CV) | 0.5836720572 | 0.5855607808 |
| Kurtosis | -1.20140369 | -1.202546777 |
| Mean | 49.484486 | 49.34656667 |
| Median Absolute Deviation (MAD) | 25 | 25 |
| Skewness | 0.00159043567 | 0.009032009929 |
| Sum | 49484486 | 1480397 |
| Variance | 834.2110375 | 834.9449564 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 38 | 10264 | 1.0% |
| 30 | 10186 | 1.0% |
| 86 | 10183 | 1.0% |
| 10 | 10180 | 1.0% |
| 14 | 10171 | 1.0% |
| 7 | 10166 | 1.0% |
| 13 | 10164 | 1.0% |
| 50 | 10151 | 1.0% |
| 67 | 10141 | 1.0% |
| 91 | 10131 | 1.0% |
| Other values (90) | 898263 |
| Value | Count | Frequency (%) |
| 50 | 334 | 1.1% |
| 79 | 333 | 1.1% |
| 51 | 332 | 1.1% |
| 76 | 330 | 1.1% |
| 38 | 329 | 1.1% |
| 10 | 327 | 1.1% |
| 18 | 326 | 1.1% |
| 62 | 325 | 1.1% |
| 4 | 320 | 1.1% |
| 36 | 320 | 1.1% |
| Other values (90) | 26724 |
| Value | Count | Frequency (%) |
| 0 | 10016 | |
| 1 | 9978 | |
| 2 | 9953 | |
| 3 | 9965 | |
| 4 | 9926 |
| Value | Count | Frequency (%) |
| 0 | 317 | |
| 1 | 281 | |
| 2 | 301 | |
| 3 | 297 | |
| 4 | 320 |
| Value | Count | Frequency (%) |
| 0 | 317 | |
| 1 | 281 | |
| 2 | 301 | |
| 3 | 297 | |
| 4 | 320 |
| Value | Count | Frequency (%) |
| 0 | 10016 | |
| 1 | 9978 | |
| 2 | 9953 | |
| 3 | 9965 | |
| 4 | 9926 |
avg_items_per_transaction
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 901 | 901 |
| Distinct (%) | 0.1% | 3.0% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 5.50312187 | 5.518995667 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 10 | 10 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1.45 | 1.44 |
| Q1 | 3.26 | 3.28 |
| median | 5.5 | 5.54 |
| Q3 | 7.75 | 7.76 |
| 95-th percentile | 9.55 | 9.55 |
| Maximum | 10 | 10 |
| Range | 9 | 9 |
| Interquartile range (IQR) | 4.49 | 4.48 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 2.597661275 | 2.597740489 |
| Coefficient of variation (CV) | 0.4720341173 | 0.4706908007 |
| Kurtosis | -1.199082145 | -1.197458824 |
| Mean | 5.50312187 | 5.518995667 |
| Median Absolute Deviation (MAD) | 2.25 | 2.24 |
| Skewness | -4.461054903 × 10-5 | -0.01353169941 |
| Sum | 5503121.87 | 165569.87 |
| Variance | 6.747844097 | 6.74825565 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 3.49 | 1205 | 0.1% |
| 5 | 1198 | 0.1% |
| 3.94 | 1197 | 0.1% |
| 6.41 | 1196 | 0.1% |
| 2.82 | 1193 | 0.1% |
| 8.41 | 1192 | 0.1% |
| 9.69 | 1192 | 0.1% |
| 4.29 | 1190 | 0.1% |
| 4.35 | 1188 | 0.1% |
| 6.14 | 1188 | 0.1% |
| Other values (891) | 988061 |
| Value | Count | Frequency (%) |
| 2.36 | 52 | 0.2% |
| 6.74 | 51 | 0.2% |
| 7.48 | 51 | 0.2% |
| 9 | 51 | 0.2% |
| 4.7 | 49 | 0.2% |
| 8.71 | 49 | 0.2% |
| 8.74 | 49 | 0.2% |
| 9.02 | 48 | 0.2% |
| 5.67 | 48 | 0.2% |
| 6.3 | 48 | 0.2% |
| Other values (891) | 29504 |
| Value | Count | Frequency (%) |
| 1 | 514 | |
| 1.01 | 1135 | |
| 1.02 | 1105 | |
| 1.03 | 1122 | |
| 1.04 | 1067 |
| Value | Count | Frequency (%) |
| 1 | 10 | < 0.1% |
| 1.01 | 40 | |
| 1.02 | 32 | |
| 1.03 | 23 | |
| 1.04 | 25 |
| Value | Count | Frequency (%) |
| 1 | 10 | < 0.1% |
| 1.01 | 40 | |
| 1.02 | 32 | |
| 1.03 | 23 | |
| 1.04 | 25 |
| Value | Count | Frequency (%) |
| 1 | 514 | |
| 1.01 | 1135 | |
| 1.02 | 1105 | |
| 1.03 | 1122 | |
| 1.04 | 1067 |
avg_transaction_value
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 49001 | 22388 |
| Distinct (%) | 4.9% | 74.6% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 255.1157678 | 255.481149 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10 | 10.02 |
| Maximum | 500 | 500 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10 | 10.02 |
| 5-th percentile | 34.52 | 33.979 |
| Q1 | 132.51 | 132.42 |
| median | 255.23 | 255.94 |
| Q3 | 377.67 | 378.7525 |
| 95-th percentile | 475.36 | 475.78 |
| Maximum | 500 | 500 |
| Range | 490 | 489.98 |
| Interquartile range (IQR) | 245.16 | 246.3325 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 141.4300141 | 141.8474272 |
| Coefficient of variation (CV) | 0.5543758243 | 0.5552168046 |
| Kurtosis | -1.200885422 | -1.20331159 |
| Mean | 255.1157678 | 255.481149 |
| Median Absolute Deviation (MAD) | 122.58 | 123.175 |
| Skewness | -0.001148163222 | -0.001707380871 |
| Sum | 255115767.8 | 7664434.47 |
| Variance | 20002.44888 | 20120.6926 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 362.11 | 43 | < 0.1% |
| 157.68 | 43 | < 0.1% |
| 86.72 | 42 | < 0.1% |
| 303.99 | 41 | < 0.1% |
| 193.66 | 41 | < 0.1% |
| 112.64 | 40 | < 0.1% |
| 342.26 | 40 | < 0.1% |
| 454.72 | 39 | < 0.1% |
| 64.18 | 39 | < 0.1% |
| 280.55 | 39 | < 0.1% |
| Other values (48991) | 999593 |
| Value | Count | Frequency (%) |
| 487.28 | 5 | < 0.1% |
| 489.95 | 5 | < 0.1% |
| 269.06 | 5 | < 0.1% |
| 47.07 | 5 | < 0.1% |
| 304.22 | 5 | < 0.1% |
| 415.71 | 5 | < 0.1% |
| 231.25 | 5 | < 0.1% |
| 242.37 | 5 | < 0.1% |
| 229.29 | 5 | < 0.1% |
| 158.25 | 5 | < 0.1% |
| Other values (22378) | 29950 |
| Value | Count | Frequency (%) |
| 10 | 8 | < 0.1% |
| 10.01 | 18 | |
| 10.02 | 17 | |
| 10.03 | 28 | |
| 10.04 | 24 |
| Value | Count | Frequency (%) |
| 10.02 | 1 | < 0.1% |
| 10.03 | 1 | < 0.1% |
| 10.05 | 1 | < 0.1% |
| 10.07 | 1 | < 0.1% |
| 10.08 | 3 |
| Value | Count | Frequency (%) |
| 10.02 | 1 | < 0.1% |
| 10.03 | 1 | < 0.1% |
| 10.05 | 1 | < 0.1% |
| 10.07 | 1 | < 0.1% |
| 10.08 | 3 |
| Value | Count | Frequency (%) |
| 10 | 8 | < 0.1% |
| 10.01 | 18 | |
| 10.02 | 17 | |
| 10.03 | 28 | |
| 10.04 | 24 |
total_returned_items
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 10 | 10 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 4.498142 | 4.504633333 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 9 | 9 |
| Zeros | 100060 | 3043 |
| Zeros (%) | 10.0% | 10.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0 | 0 |
| Q1 | 2 | 2 |
| median | 4 | 5 |
| Q3 | 7 | 7 |
| 95-th percentile | 9 | 9 |
| Maximum | 9 | 9 |
| Range | 9 | 9 |
| Interquartile range (IQR) | 5 | 5 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 2.872805041 | 2.878191793 |
| Coefficient of variation (CV) | 0.6386648177 | 0.6389403044 |
| Kurtosis | -1.225109848 | -1.225473688 |
| Mean | 4.498142 | 4.504633333 |
| Median Absolute Deviation (MAD) | 3 | 3 |
| Skewness | 0.0007692254728 | -0.001610088949 |
| Sum | 4498142 | 135139 |
| Variance | 8.253008801 | 8.283987998 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 100298 | |
| 7 | 100190 | |
| 3 | 100119 | |
| 0 | 100060 | |
| 6 | 100004 | |
| 2 | 99991 | |
| 9 | 99942 | |
| 8 | 99838 | |
| 4 | 99821 | |
| 5 | 99737 |
| Value | Count | Frequency (%) |
| 9 | 3045 | |
| 0 | 3043 | |
| 3 | 3021 | |
| 6 | 3016 | |
| 8 | 3006 | |
| 2 | 3003 | |
| 5 | 2991 | |
| 4 | 2969 | |
| 7 | 2964 | |
| 1 | 2942 |
| Value | Count | Frequency (%) |
| 0 | 100060 | |
| 1 | 100298 | |
| 2 | 99991 | |
| 3 | 100119 | |
| 4 | 99821 |
| Value | Count | Frequency (%) |
| 0 | 3043 | |
| 1 | 2942 | |
| 2 | 3003 | |
| 3 | 3021 | |
| 4 | 2969 |
| Value | Count | Frequency (%) |
| 0 | 3043 | |
| 1 | 2942 | |
| 2 | 3003 | |
| 3 | 3021 | |
| 4 | 2969 |
| Value | Count | Frequency (%) |
| 0 | 100060 | |
| 1 | 100298 | |
| 2 | 99991 | |
| 3 | 100119 | |
| 4 | 99821 |
total_returned_value
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 99999 | 26021 |
| Distinct (%) | 10.0% | 86.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 500.3878374 | 501.730237 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0.01 |
| Maximum | 1000 | 999.99 |
| Zeros | 4 | 0 |
| Zeros (%) | < 0.1% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0.01 |
| 5-th percentile | 50 | 52.979 |
| Q1 | 250.63 | 253.8725 |
| median | 500.4 | 501.7 |
| Q3 | 750.39 | 748.98 |
| 95-th percentile | 950.22 | 949.2215 |
| Maximum | 1000 | 999.99 |
| Range | 1000 | 999.98 |
| Interquartile range (IQR) | 499.76 | 495.1075 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 288.7174763 | 287.2884784 |
| Coefficient of variation (CV) | 0.5769873981 | 0.5725955049 |
| Kurtosis | -1.199754459 | -1.188818796 |
| Mean | 500.3878374 | 501.730237 |
| Median Absolute Deviation (MAD) | 249.89 | 247.65 |
| Skewness | -0.001264828821 | -0.002747522577 |
| Sum | 500387837.4 | 15051907.11 |
| Variance | 83357.78115 | 82534.66982 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 160.66 | 28 | < 0.1% |
| 467.66 | 26 | < 0.1% |
| 188.3 | 26 | < 0.1% |
| 488.88 | 25 | < 0.1% |
| 544.94 | 25 | < 0.1% |
| 651.87 | 25 | < 0.1% |
| 981.42 | 25 | < 0.1% |
| 330.91 | 25 | < 0.1% |
| 676.05 | 25 | < 0.1% |
| 227.59 | 25 | < 0.1% |
| Other values (99989) | 999745 |
| Value | Count | Frequency (%) |
| 514.02 | 5 | < 0.1% |
| 714.49 | 5 | < 0.1% |
| 928.66 | 4 | < 0.1% |
| 973.06 | 4 | < 0.1% |
| 139.72 | 4 | < 0.1% |
| 937.33 | 4 | < 0.1% |
| 198.43 | 4 | < 0.1% |
| 577.78 | 4 | < 0.1% |
| 866.57 | 4 | < 0.1% |
| 281.52 | 4 | < 0.1% |
| Other values (26011) | 29958 |
| Value | Count | Frequency (%) |
| 0 | 4 | < 0.1% |
| 0.01 | 13 | |
| 0.02 | 12 | |
| 0.03 | 11 | |
| 0.04 | 7 |
| Value | Count | Frequency (%) |
| 0.01 | 1 | |
| 0.02 | 1 | |
| 0.04 | 1 | |
| 0.11 | 1 | |
| 0.17 | 1 |
| Value | Count | Frequency (%) |
| 0.01 | 1 | |
| 0.02 | 1 | |
| 0.04 | 1 | |
| 0.11 | 1 | |
| 0.17 | 1 |
| Value | Count | Frequency (%) |
| 0 | 4 | < 0.1% |
| 0.01 | 13 | |
| 0.02 | 12 | |
| 0.03 | 11 | |
| 0.04 | 7 |
total_sales
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 629254 | 29563 |
| Distinct (%) | 62.9% | 98.5% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 5056.059765 | 5070.517322 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 100.01 | 100.04 |
| Maximum | 9999.98 | 9998.05 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 100.01 | 100.04 |
| 5-th percentile | 595.7095 | 581.918 |
| Q1 | 2577.8675 | 2552.5375 |
| median | 5059.695 | 5102.03 |
| Q3 | 7534.8025 | 7586.2775 |
| 95-th percentile | 9507.96 | 9522.561 |
| Maximum | 9999.98 | 9998.05 |
| Range | 9899.97 | 9898.01 |
| Interquartile range (IQR) | 4956.935 | 5033.74 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 2859.100058 | 2876.151961 |
| Coefficient of variation (CV) | 0.5654798777 | 0.5672304774 |
| Kurtosis | -1.201132214 | -1.219650902 |
| Mean | 5056.059765 | 5070.517322 |
| Median Absolute Deviation (MAD) | 2478.365 | 2517.305 |
| Skewness | -0.002792355347 | -0.009818384845 |
| Sum | 5056059765 | 152115519.7 |
| Variance | 8174453.14 | 8272250.103 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 9263.29 | 8 | < 0.1% |
| 1070.51 | 8 | < 0.1% |
| 8973.11 | 8 | < 0.1% |
| 7882.97 | 8 | < 0.1% |
| 630.03 | 8 | < 0.1% |
| 8669.59 | 8 | < 0.1% |
| 8191.02 | 8 | < 0.1% |
| 2558.91 | 8 | < 0.1% |
| 5572.95 | 8 | < 0.1% |
| 8266.95 | 8 | < 0.1% |
| Other values (629244) | 999920 |
| Value | Count | Frequency (%) |
| 5573.97 | 3 | < 0.1% |
| 6929.76 | 2 | < 0.1% |
| 4310.88 | 2 | < 0.1% |
| 1790.99 | 2 | < 0.1% |
| 4228.4 | 2 | < 0.1% |
| 8710.53 | 2 | < 0.1% |
| 7292.92 | 2 | < 0.1% |
| 3642.88 | 2 | < 0.1% |
| 5923.6 | 2 | < 0.1% |
| 2068.75 | 2 | < 0.1% |
| Other values (29553) | 29979 |
| Value | Count | Frequency (%) |
| 100.01 | 2 | |
| 100.02 | 2 | |
| 100.04 | 2 | |
| 100.05 | 2 | |
| 100.06 | 3 |
| Value | Count | Frequency (%) |
| 100.04 | 1 | |
| 100.06 | 2 | |
| 100.32 | 1 | |
| 100.36 | 1 | |
| 100.42 | 1 |
| Value | Count | Frequency (%) |
| 100.04 | 1 | |
| 100.06 | 2 | |
| 100.32 | 1 | |
| 100.36 | 1 | |
| 100.42 | 1 |
| Value | Count | Frequency (%) |
| 100.01 | 2 | |
| 100.02 | 2 | |
| 100.04 | 2 | |
| 100.05 | 2 | |
| 100.06 | 3 |
total_transactions
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 99 | 99 |
| Distinct (%) | < 0.1% | 0.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.987386 | 49.87116667 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 99 | 99 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 5 | 5 |
| Q1 | 25 | 25 |
| median | 50 | 50 |
| Q3 | 75 | 75 |
| 95-th percentile | 95 | 94 |
| Maximum | 99 | 99 |
| Range | 98 | 98 |
| Interquartile range (IQR) | 50 | 50 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 28.57168895 | 28.58786358 |
| Coefficient of variation (CV) | 0.5715779766 | 0.5732343054 |
| Kurtosis | -1.200697232 | -1.197065103 |
| Mean | 49.987386 | 49.87116667 |
| Median Absolute Deviation (MAD) | 25 | 25 |
| Skewness | 6.496950968 × 10-5 | -0.000849598413 |
| Sum | 49987386 | 1496135 |
| Variance | 816.3414092 | 817.2659442 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 93 | 10385 | 1.0% |
| 24 | 10328 | 1.0% |
| 61 | 10316 | 1.0% |
| 70 | 10306 | 1.0% |
| 49 | 10290 | 1.0% |
| 14 | 10280 | 1.0% |
| 83 | 10278 | 1.0% |
| 27 | 10251 | 1.0% |
| 16 | 10247 | 1.0% |
| 75 | 10245 | 1.0% |
| Other values (89) | 897074 |
| Value | Count | Frequency (%) |
| 57 | 342 | 1.1% |
| 86 | 339 | 1.1% |
| 94 | 335 | 1.1% |
| 35 | 335 | 1.1% |
| 5 | 333 | 1.1% |
| 4 | 331 | 1.1% |
| 25 | 329 | 1.1% |
| 50 | 329 | 1.1% |
| 28 | 327 | 1.1% |
| 88 | 326 | 1.1% |
| Other values (89) | 26674 |
| Value | Count | Frequency (%) |
| 1 | 10053 | |
| 2 | 10174 | |
| 3 | 10113 | |
| 4 | 10133 | |
| 5 | 10140 |
| Value | Count | Frequency (%) |
| 1 | 323 | |
| 2 | 298 | |
| 3 | 308 | |
| 4 | 331 | |
| 5 | 333 |
| Value | Count | Frequency (%) |
| 1 | 323 | |
| 2 | 298 | |
| 3 | 308 | |
| 4 | 331 | |
| 5 | 333 |
| Value | Count | Frequency (%) |
| 1 | 10053 | |
| 2 | 10174 | |
| 3 | 10113 | |
| 4 | 10133 | |
| 5 | 10140 |
total_items_purchased
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 499 | 499 |
| Distinct (%) | < 0.1% | 1.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 250.042763 | 249.4003667 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 499 | 499 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 26 | 25 |
| Q1 | 125 | 124 |
| median | 250 | 251 |
| Q3 | 375 | 373 |
| 95-th percentile | 475 | 474 |
| Maximum | 499 | 499 |
| Range | 498 | 498 |
| Interquartile range (IQR) | 250 | 249 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 143.9845462 | 143.5749326 |
| Coefficient of variation (CV) | 0.5758396862 | 0.5756805192 |
| Kurtosis | -1.199364571 | -1.195088792 |
| Mean | 250.042763 | 249.4003667 |
| Median Absolute Deviation (MAD) | 125 | 124 |
| Skewness | -0.0005289537985 | -0.003945628736 |
| Sum | 250042763 | 7482011 |
| Variance | 20731.54954 | 20613.76127 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 282 | 2156 | 0.2% |
| 285 | 2146 | 0.2% |
| 355 | 2132 | 0.2% |
| 459 | 2099 | 0.2% |
| 296 | 2098 | 0.2% |
| 241 | 2096 | 0.2% |
| 413 | 2090 | 0.2% |
| 331 | 2088 | 0.2% |
| 425 | 2087 | 0.2% |
| 260 | 2086 | 0.2% |
| Other values (489) | 978922 |
| Value | Count | Frequency (%) |
| 432 | 84 | 0.3% |
| 19 | 82 | 0.3% |
| 27 | 81 | 0.3% |
| 109 | 81 | 0.3% |
| 446 | 78 | 0.3% |
| 425 | 78 | 0.3% |
| 117 | 78 | 0.3% |
| 327 | 78 | 0.3% |
| 312 | 78 | 0.3% |
| 301 | 78 | 0.3% |
| Other values (489) | 29204 |
| Value | Count | Frequency (%) |
| 1 | 2005 | |
| 2 | 2077 | |
| 3 | 1999 | |
| 4 | 2019 | |
| 5 | 1988 |
| Value | Count | Frequency (%) |
| 1 | 69 | |
| 2 | 53 | |
| 3 | 50 | |
| 4 | 61 | |
| 5 | 63 |
| Value | Count | Frequency (%) |
| 1 | 69 | |
| 2 | 53 | |
| 3 | 50 | |
| 4 | 61 | |
| 5 | 63 |
| Value | Count | Frequency (%) |
| 1 | 2005 | |
| 2 | 2077 | |
| 3 | 1999 | |
| 4 | 2019 | |
| 5 | 1988 |
total_discounts_received
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 99995 | 25886 |
| Distinct (%) | 10.0% | 86.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 499.6743882 | 498.7390803 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0.01 |
| Maximum | 1000 | 999.98 |
| Zeros | 6 | 0 |
| Zeros (%) | < 0.1% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0.01 |
| 5-th percentile | 50.16 | 50.268 |
| Q1 | 249.76 | 251.03 |
| median | 499.51 | 496.145 |
| Q3 | 749.54 | 749.735 |
| 95-th percentile | 949.66 | 947.691 |
| Maximum | 1000 | 999.98 |
| Range | 1000 | 999.97 |
| Interquartile range (IQR) | 499.78 | 498.705 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 288.5791016 | 288.1539788 |
| Coefficient of variation (CV) | 0.5775343071 | 0.5777649882 |
| Kurtosis | -1.200167414 | -1.2000378 |
| Mean | 499.6743882 | 498.7390803 |
| Median Absolute Deviation (MAD) | 249.9 | 249.435 |
| Skewness | 0.0009745010535 | 0.004135538111 |
| Sum | 499674388.2 | 14962172.41 |
| Variance | 83277.89786 | 83032.71553 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 52.87 | 26 | < 0.1% |
| 811.21 | 26 | < 0.1% |
| 721.58 | 26 | < 0.1% |
| 418.88 | 25 | < 0.1% |
| 760.97 | 25 | < 0.1% |
| 406.5 | 24 | < 0.1% |
| 784.58 | 24 | < 0.1% |
| 595.87 | 24 | < 0.1% |
| 918 | 24 | < 0.1% |
| 34.86 | 24 | < 0.1% |
| Other values (99985) | 999752 |
| Value | Count | Frequency (%) |
| 182.63 | 5 | < 0.1% |
| 61.51 | 5 | < 0.1% |
| 928.19 | 4 | < 0.1% |
| 394.34 | 4 | < 0.1% |
| 472.48 | 4 | < 0.1% |
| 427.73 | 4 | < 0.1% |
| 10.12 | 4 | < 0.1% |
| 705.39 | 4 | < 0.1% |
| 268.2 | 4 | < 0.1% |
| 98.15 | 4 | < 0.1% |
| Other values (25876) | 29958 |
| Value | Count | Frequency (%) |
| 0 | 6 | |
| 0.01 | 13 | |
| 0.02 | 8 | |
| 0.03 | 8 | |
| 0.04 | 6 |
| Value | Count | Frequency (%) |
| 0.01 | 1 | |
| 0.03 | 1 | |
| 0.05 | 1 | |
| 0.12 | 2 | |
| 0.13 | 1 |
| Value | Count | Frequency (%) |
| 0.01 | 1 | |
| 0.03 | 1 | |
| 0.05 | 1 | |
| 0.12 | 2 | |
| 0.13 | 1 |
| Value | Count | Frequency (%) |
| 0 | 6 | |
| 0.01 | 13 | |
| 0.02 | 8 | |
| 0.03 | 8 | |
| 0.04 | 6 |
avg_spent_per_category
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 98999 | 25874 |
| Distinct (%) | 9.9% | 86.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 505.1754779 | 507.2349573 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10 | 10.02 |
| Maximum | 1000 | 999.92 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10 | 10.02 |
| 5-th percentile | 59.49 | 59.169 |
| Q1 | 257.24 | 259.7075 |
| median | 505.14 | 510.545 |
| Q3 | 753.06 | 755.1775 |
| 95-th percentile | 950.7405 | 949.5105 |
| Maximum | 1000 | 999.92 |
| Range | 990 | 989.9 |
| Interquartile range (IQR) | 495.82 | 495.47 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 286.0591784 | 286.5485484 |
| Coefficient of variation (CV) | 0.566257055 | 0.5649227134 |
| Kurtosis | -1.201963641 | -1.203596811 |
| Mean | 505.1754779 | 507.2349573 |
| Median Absolute Deviation (MAD) | 247.91 | 247.71 |
| Skewness | -0.0002454959133 | -0.02254511471 |
| Sum | 505175477.9 | 15217048.72 |
| Variance | 81829.85355 | 82110.07062 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 202.69 | 27 | < 0.1% |
| 969.16 | 26 | < 0.1% |
| 582.24 | 26 | < 0.1% |
| 806.1 | 25 | < 0.1% |
| 330.74 | 25 | < 0.1% |
| 798.54 | 25 | < 0.1% |
| 299.29 | 25 | < 0.1% |
| 312.38 | 25 | < 0.1% |
| 825.53 | 24 | < 0.1% |
| 525.28 | 24 | < 0.1% |
| Other values (98989) | 999748 |
| Value | Count | Frequency (%) |
| 141.03 | 5 | < 0.1% |
| 36.15 | 4 | < 0.1% |
| 548.8 | 4 | < 0.1% |
| 710.93 | 4 | < 0.1% |
| 319.46 | 4 | < 0.1% |
| 333.34 | 4 | < 0.1% |
| 574.26 | 4 | < 0.1% |
| 665.71 | 4 | < 0.1% |
| 992.05 | 4 | < 0.1% |
| 514.73 | 4 | < 0.1% |
| Other values (25864) | 29959 |
| Value | Count | Frequency (%) |
| 10 | 4 | < 0.1% |
| 10.01 | 8 | |
| 10.02 | 13 | |
| 10.03 | 10 | |
| 10.04 | 13 |
| Value | Count | Frequency (%) |
| 10.02 | 1 | |
| 10.03 | 1 | |
| 10.05 | 2 | |
| 10.06 | 2 | |
| 10.1 | 1 |
| Value | Count | Frequency (%) |
| 10.02 | 1 | |
| 10.03 | 1 | |
| 10.05 | 2 | |
| 10.06 | 2 | |
| 10.1 | 1 |
| Value | Count | Frequency (%) |
| 10 | 4 | < 0.1% |
| 10.01 | 8 | |
| 10.02 | 13 | |
| 10.03 | 10 | |
| 10.04 | 13 |
max_single_purchase_value
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 99001 | 25794 |
| Distinct (%) | 9.9% | 86.0% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 505.0014045 | 504.4945963 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10 | 10.02 |
| Maximum | 1000 | 999.99 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10 | 10.02 |
| 5-th percentile | 59.3 | 58.23 |
| Q1 | 256.84 | 255.115 |
| median | 505.22 | 506.13 |
| Q3 | 753.21 | 752.72 |
| 95-th percentile | 950.55 | 951.5725 |
| Maximum | 1000 | 999.99 |
| Range | 990 | 989.97 |
| Interquartile range (IQR) | 496.37 | 497.605 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 286.0733241 | 286.2036938 |
| Coefficient of variation (CV) | 0.5664802545 | 0.5673077489 |
| Kurtosis | -1.202495075 | -1.198648107 |
| Mean | 505.0014045 | 504.4945963 |
| Median Absolute Deviation (MAD) | 248.18 | 248.68 |
| Skewness | -0.0008466890807 | -0.0007518293936 |
| Sum | 505001404.5 | 15134837.89 |
| Variance | 81837.94677 | 81912.55434 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 575.57 | 28 | < 0.1% |
| 461.6 | 26 | < 0.1% |
| 874.29 | 25 | < 0.1% |
| 105.78 | 25 | < 0.1% |
| 354.85 | 25 | < 0.1% |
| 736.87 | 25 | < 0.1% |
| 439.72 | 25 | < 0.1% |
| 893.75 | 24 | < 0.1% |
| 179.32 | 24 | < 0.1% |
| 330.94 | 24 | < 0.1% |
| Other values (98991) | 999749 |
| Value | Count | Frequency (%) |
| 157.57 | 5 | < 0.1% |
| 985.66 | 4 | < 0.1% |
| 661.26 | 4 | < 0.1% |
| 621.57 | 4 | < 0.1% |
| 970.54 | 4 | < 0.1% |
| 139.51 | 4 | < 0.1% |
| 477.98 | 4 | < 0.1% |
| 701.49 | 4 | < 0.1% |
| 452.59 | 4 | < 0.1% |
| 846.64 | 4 | < 0.1% |
| Other values (25784) | 29959 |
| Value | Count | Frequency (%) |
| 10 | 6 | < 0.1% |
| 10.01 | 8 | |
| 10.02 | 15 | |
| 10.03 | 5 | < 0.1% |
| 10.04 | 15 |
| Value | Count | Frequency (%) |
| 10.02 | 1 | |
| 10.08 | 1 | |
| 10.11 | 1 | |
| 10.16 | 1 | |
| 10.17 | 1 |
| Value | Count | Frequency (%) |
| 10.02 | 1 | |
| 10.08 | 1 | |
| 10.11 | 1 | |
| 10.16 | 1 | |
| 10.17 | 1 |
| Value | Count | Frequency (%) |
| 10 | 6 | < 0.1% |
| 10.01 | 8 | |
| 10.02 | 15 | |
| 10.03 | 5 | < 0.1% |
| 10.04 | 15 |
min_single_purchase_value
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 991 | 991 |
| Distinct (%) | 0.1% | 3.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 5.04384896 | 5.012402 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0.1 | 0.1 |
| Maximum | 10 | 10 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0.1 | 0.1 |
| 5-th percentile | 0.59 | 0.57 |
| Q1 | 2.57 | 2.53 |
| median | 5.04 | 5.01 |
| Q3 | 7.51 | 7.48 |
| 95-th percentile | 9.5 | 9.52 |
| Maximum | 10 | 10 |
| Range | 9.9 | 9.9 |
| Interquartile range (IQR) | 4.94 | 4.95 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 2.855904644 | 2.856644035 |
| Coefficient of variation (CV) | 0.566215338 | 0.5699151894 |
| Kurtosis | -1.198193882 | -1.191228918 |
| Mean | 5.04384896 | 5.012402 |
| Median Absolute Deviation (MAD) | 2.47 | 2.48 |
| Skewness | 0.002415403507 | 0.01785919507 |
| Sum | 5043848.96 | 150372.06 |
| Variance | 8.156191335 | 8.160415144 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 4.66 | 1123 | 0.1% |
| 0.29 | 1112 | 0.1% |
| 3.05 | 1110 | 0.1% |
| 1.67 | 1101 | 0.1% |
| 4.67 | 1092 | 0.1% |
| 6.93 | 1092 | 0.1% |
| 6.14 | 1091 | 0.1% |
| 5.19 | 1091 | 0.1% |
| 5.31 | 1086 | 0.1% |
| 5.02 | 1086 | 0.1% |
| Other values (981) | 989016 |
| Value | Count | Frequency (%) |
| 3.88 | 53 | 0.2% |
| 9.99 | 49 | 0.2% |
| 2.27 | 49 | 0.2% |
| 2.65 | 47 | 0.2% |
| 3.84 | 47 | 0.2% |
| 1.72 | 47 | 0.2% |
| 1.74 | 46 | 0.2% |
| 3.25 | 46 | 0.2% |
| 0.34 | 45 | 0.1% |
| 9.92 | 45 | 0.1% |
| Other values (981) | 29526 |
| Value | Count | Frequency (%) |
| 0.1 | 491 | |
| 0.11 | 1041 | |
| 0.12 | 1011 | |
| 0.13 | 1044 | |
| 0.14 | 1013 |
| Value | Count | Frequency (%) |
| 0.1 | 8 | < 0.1% |
| 0.11 | 32 | |
| 0.12 | 32 | |
| 0.13 | 40 | |
| 0.14 | 37 |
| Value | Count | Frequency (%) |
| 0.1 | 8 | < 0.1% |
| 0.11 | 32 | |
| 0.12 | 32 | |
| 0.13 | 40 | |
| 0.14 | 37 |
| Value | Count | Frequency (%) |
| 0.1 | 491 | |
| 0.11 | 1041 | |
| 0.12 | 1011 | |
| 0.13 | 1044 | |
| 0.14 | 1013 |
product_name
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 9 | 9 |
| Median length | 9 | 9 |
| Mean length | 9 | 9 |
| Min length | 9 | 9 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Product D | Product D |
| 2nd row | Product C | Product A |
| 3rd row | Product B | Product B |
| 4th row | Product A | Product B |
| 5th row | Product C | Product A |
| Value | Count | Frequency (%) |
| product | 1000000 | |
| b | 250375 | 12.5% |
| c | 249957 | 12.5% |
| a | 249928 | 12.5% |
| d | 249740 | 12.5% |
| Value | Count | Frequency (%) |
| product | 30000 | |
| a | 7665 | 12.8% |
| b | 7481 | 12.5% |
| c | 7442 | 12.4% |
| d | 7412 | 12.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 1000000 | |
| r | 1000000 | |
| o | 1000000 | |
| d | 1000000 | |
| u | 1000000 | |
| c | 1000000 | |
| t | 1000000 | |
| 1000000 | ||
| B | 250375 | 2.8% |
| C | 249957 | 2.8% |
| Other values (2) | 499668 |
| Value | Count | Frequency (%) |
| P | 30000 | |
| r | 30000 | |
| o | 30000 | |
| d | 30000 | |
| u | 30000 | |
| c | 30000 | |
| t | 30000 | |
| 30000 | ||
| A | 7665 | 2.8% |
| B | 7481 | 2.8% |
| Other values (2) | 14854 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
| Value | Count | Frequency (%) |
| (unknown) | 270000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| P | 1000000 | |
| r | 1000000 | |
| o | 1000000 | |
| d | 1000000 | |
| u | 1000000 | |
| c | 1000000 | |
| t | 1000000 | |
| 1000000 | ||
| B | 250375 | 2.8% |
| C | 249957 | 2.8% |
| Other values (2) | 499668 |
| Value | Count | Frequency (%) |
| P | 30000 | |
| r | 30000 | |
| o | 30000 | |
| d | 30000 | |
| u | 30000 | |
| c | 30000 | |
| t | 30000 | |
| 30000 | ||
| A | 7665 | 2.8% |
| B | 7481 | 2.8% |
| Other values (2) | 14854 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
| Value | Count | Frequency (%) |
| (unknown) | 270000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| P | 1000000 | |
| r | 1000000 | |
| o | 1000000 | |
| d | 1000000 | |
| u | 1000000 | |
| c | 1000000 | |
| t | 1000000 | |
| 1000000 | ||
| B | 250375 | 2.8% |
| C | 249957 | 2.8% |
| Other values (2) | 499668 |
| Value | Count | Frequency (%) |
| P | 30000 | |
| r | 30000 | |
| o | 30000 | |
| d | 30000 | |
| u | 30000 | |
| c | 30000 | |
| t | 30000 | |
| 30000 | ||
| A | 7665 | 2.8% |
| B | 7481 | 2.8% |
| Other values (2) | 14854 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
| Value | Count | Frequency (%) |
| (unknown) | 270000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| P | 1000000 | |
| r | 1000000 | |
| o | 1000000 | |
| d | 1000000 | |
| u | 1000000 | |
| c | 1000000 | |
| t | 1000000 | |
| 1000000 | ||
| B | 250375 | 2.8% |
| C | 249957 | 2.8% |
| Other values (2) | 499668 |
| Value | Count | Frequency (%) |
| P | 30000 | |
| r | 30000 | |
| o | 30000 | |
| d | 30000 | |
| u | 30000 | |
| c | 30000 | |
| t | 30000 | |
| 30000 | ||
| A | 7665 | 2.8% |
| B | 7481 | 2.8% |
| Other values (2) | 14854 |
product_brand
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 7 | 7 |
| Mean length | 7 | 7 |
| Min length | 7 | 7 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Brand Y | Brand Y |
| 2nd row | Brand X | Brand X |
| 3rd row | Brand X | Brand Z |
| 4th row | Brand Z | Brand Y |
| 5th row | Brand X | Brand X |
| Value | Count | Frequency (%) |
| brand | 1000000 | |
| y | 333775 | 16.7% |
| z | 333608 | 16.7% |
| x | 332617 | 16.6% |
| Value | Count | Frequency (%) |
| brand | 30000 | |
| y | 10245 | 17.1% |
| x | 9890 | 16.5% |
| z | 9865 | 16.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 1000000 | |
| r | 1000000 | |
| a | 1000000 | |
| n | 1000000 | |
| d | 1000000 | |
| 1000000 | ||
| Y | 333775 | 4.8% |
| Z | 333608 | 4.8% |
| X | 332617 | 4.8% |
| Value | Count | Frequency (%) |
| B | 30000 | |
| r | 30000 | |
| a | 30000 | |
| n | 30000 | |
| d | 30000 | |
| 30000 | ||
| Y | 10245 | 4.9% |
| X | 9890 | 4.7% |
| Z | 9865 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 1000000 | |
| r | 1000000 | |
| a | 1000000 | |
| n | 1000000 | |
| d | 1000000 | |
| 1000000 | ||
| Y | 333775 | 4.8% |
| Z | 333608 | 4.8% |
| X | 332617 | 4.8% |
| Value | Count | Frequency (%) |
| B | 30000 | |
| r | 30000 | |
| a | 30000 | |
| n | 30000 | |
| d | 30000 | |
| 30000 | ||
| Y | 10245 | 4.9% |
| X | 9890 | 4.7% |
| Z | 9865 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 1000000 | |
| r | 1000000 | |
| a | 1000000 | |
| n | 1000000 | |
| d | 1000000 | |
| 1000000 | ||
| Y | 333775 | 4.8% |
| Z | 333608 | 4.8% |
| X | 332617 | 4.8% |
| Value | Count | Frequency (%) |
| B | 30000 | |
| r | 30000 | |
| a | 30000 | |
| n | 30000 | |
| d | 30000 | |
| 30000 | ||
| Y | 10245 | 4.9% |
| X | 9890 | 4.7% |
| Z | 9865 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 1000000 | |
| r | 1000000 | |
| a | 1000000 | |
| n | 1000000 | |
| d | 1000000 | |
| 1000000 | ||
| Y | 333775 | 4.8% |
| Z | 333608 | 4.8% |
| X | 332617 | 4.8% |
| Value | Count | Frequency (%) |
| B | 30000 | |
| r | 30000 | |
| a | 30000 | |
| n | 30000 | |
| d | 30000 | |
| 30000 | ||
| Y | 10245 | 4.9% |
| X | 9890 | 4.7% |
| Z | 9865 | 4.7% |
product_rating
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 41 | 41 |
| Distinct (%) | < 0.1% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 2.9990096 | 3.000233333 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 5 | 5 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1.2 | 1.2 |
| Q1 | 2 | 2 |
| median | 3 | 3 |
| Q3 | 4 | 4 |
| 95-th percentile | 4.8 | 4.8 |
| Maximum | 5 | 5 |
| Range | 4 | 4 |
| Interquartile range (IQR) | 2 | 2 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 1.154800603 | 1.155675473 |
| Coefficient of variation (CV) | 0.3850606557 | 0.385195198 |
| Kurtosis | -1.196293362 | -1.194473672 |
| Mean | 2.9990096 | 3.000233333 |
| Median Absolute Deviation (MAD) | 1 | 1 |
| Skewness | -0.0005343871929 | -0.0009974057543 |
| Sum | 2999009.6 | 90007 |
| Variance | 1.333564433 | 1.335585798 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 2.9 | 25242 | 2.5% |
| 3.4 | 25229 | 2.5% |
| 2.6 | 25229 | 2.5% |
| 1.3 | 25194 | 2.5% |
| 3 | 25181 | 2.5% |
| 4.7 | 25166 | 2.5% |
| 4.3 | 25159 | 2.5% |
| 4.1 | 25146 | 2.5% |
| 1.6 | 25141 | 2.5% |
| 4 | 25134 | 2.5% |
| Other values (31) | 748179 |
| Value | Count | Frequency (%) |
| 4.9 | 809 | 2.7% |
| 3.4 | 806 | 2.7% |
| 4.7 | 790 | 2.6% |
| 1.5 | 790 | 2.6% |
| 3 | 785 | 2.6% |
| 3.7 | 780 | 2.6% |
| 3.6 | 779 | 2.6% |
| 2.9 | 775 | 2.6% |
| 1.2 | 766 | 2.6% |
| 4.5 | 766 | 2.6% |
| Other values (31) | 22154 |
| Value | Count | Frequency (%) |
| 1 | 12653 | |
| 1.1 | 24871 | |
| 1.2 | 25095 | |
| 1.3 | 25194 | |
| 1.4 | 24848 |
| Value | Count | Frequency (%) |
| 1 | 370 | |
| 1.1 | 741 | |
| 1.2 | 766 | |
| 1.3 | 756 | |
| 1.4 | 760 |
| Value | Count | Frequency (%) |
| 1 | 370 | |
| 1.1 | 741 | |
| 1.2 | 766 | |
| 1.3 | 756 | |
| 1.4 | 760 |
| Value | Count | Frequency (%) |
| 1 | 12653 | |
| 1.1 | 24871 | |
| 1.2 | 25095 | |
| 1.3 | 25194 | |
| 1.4 | 24848 |
product_review_count
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 1000 | 1000 |
| Distinct (%) | 0.1% | 3.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 499.235198 | 499.7586667 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 999 | 999 |
| Zeros | 987 | 26 |
| Zeros (%) | 0.1% | 0.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 50 | 52 |
| Q1 | 250 | 250 |
| median | 499 | 499 |
| Q3 | 749 | 751 |
| 95-th percentile | 949 | 952.05 |
| Maximum | 999 | 999 |
| Range | 999 | 999 |
| Interquartile range (IQR) | 499 | 501 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 288.4461496 | 288.5432539 |
| Coefficient of variation (CV) | 0.5777760678 | 0.5773651828 |
| Kurtosis | -1.19905271 | -1.1968163 |
| Mean | 499.235198 | 499.7586667 |
| Median Absolute Deviation (MAD) | 250 | 250 |
| Skewness | 0.001190014496 | 0.008804364011 |
| Sum | 499235198 | 14992760 |
| Variance | 83201.18122 | 83257.2094 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 974 | 1095 | 0.1% |
| 56 | 1089 | 0.1% |
| 769 | 1089 | 0.1% |
| 725 | 1088 | 0.1% |
| 683 | 1085 | 0.1% |
| 229 | 1082 | 0.1% |
| 501 | 1079 | 0.1% |
| 937 | 1074 | 0.1% |
| 384 | 1073 | 0.1% |
| 497 | 1072 | 0.1% |
| Other values (990) | 989174 |
| Value | Count | Frequency (%) |
| 333 | 51 | 0.2% |
| 672 | 48 | 0.2% |
| 769 | 47 | 0.2% |
| 580 | 47 | 0.2% |
| 220 | 46 | 0.2% |
| 833 | 45 | 0.1% |
| 559 | 44 | 0.1% |
| 638 | 44 | 0.1% |
| 869 | 44 | 0.1% |
| 119 | 44 | 0.1% |
| Other values (990) | 29540 |
| Value | Count | Frequency (%) |
| 0 | 987 | |
| 1 | 999 | |
| 2 | 1006 | |
| 3 | 1006 | |
| 4 | 1027 |
| Value | Count | Frequency (%) |
| 0 | 26 | |
| 1 | 34 | |
| 2 | 26 | |
| 3 | 41 | |
| 4 | 18 |
| Value | Count | Frequency (%) |
| 0 | 26 | |
| 1 | 34 | |
| 2 | 26 | |
| 3 | 41 | |
| 4 | 18 |
| Value | Count | Frequency (%) |
| 0 | 987 | |
| 1 | 999 | |
| 2 | 1006 | |
| 3 | 1006 | |
| 4 | 1027 |
product_stock
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 100 | 100 |
| Distinct (%) | < 0.1% | 0.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.515129 | 49.78733333 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 99 | 99 |
| Zeros | 10174 | 289 |
| Zeros (%) | 1.0% | 1.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 5 | 5 |
| Q1 | 25 | 25 |
| median | 49 | 50 |
| Q3 | 75 | 75 |
| 95-th percentile | 95 | 95 |
| Maximum | 99 | 99 |
| Range | 99 | 99 |
| Interquartile range (IQR) | 50 | 50 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 28.87664529 | 28.98851929 |
| Coefficient of variation (CV) | 0.5831883279 | 0.5822468759 |
| Kurtosis | -1.200520476 | -1.209739776 |
| Mean | 49.515129 | 49.78733333 |
| Median Absolute Deviation (MAD) | 25 | 25 |
| Skewness | 0.0006383736941 | -0.01261429473 |
| Sum | 49515129 | 1493620 |
| Variance | 833.860643 | 840.3342507 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 89 | 10261 | 1.0% |
| 70 | 10245 | 1.0% |
| 60 | 10187 | 1.0% |
| 23 | 10175 | 1.0% |
| 0 | 10174 | 1.0% |
| 54 | 10171 | 1.0% |
| 44 | 10148 | 1.0% |
| 96 | 10147 | 1.0% |
| 32 | 10138 | 1.0% |
| 77 | 10136 | 1.0% |
| Other values (90) | 898218 |
| Value | Count | Frequency (%) |
| 89 | 344 | 1.1% |
| 52 | 336 | 1.1% |
| 59 | 333 | 1.1% |
| 83 | 330 | 1.1% |
| 78 | 329 | 1.1% |
| 70 | 329 | 1.1% |
| 98 | 328 | 1.1% |
| 12 | 326 | 1.1% |
| 82 | 324 | 1.1% |
| 27 | 324 | 1.1% |
| Other values (90) | 26697 |
| Value | Count | Frequency (%) |
| 0 | 10174 | |
| 1 | 9857 | |
| 2 | 9895 | |
| 3 | 10030 | |
| 4 | 9924 |
| Value | Count | Frequency (%) |
| 0 | 289 | |
| 1 | 274 | |
| 2 | 308 | |
| 3 | 316 | |
| 4 | 297 |
| Value | Count | Frequency (%) |
| 0 | 289 | |
| 1 | 274 | |
| 2 | 308 | |
| 3 | 316 | |
| 4 | 297 |
| Value | Count | Frequency (%) |
| 0 | 10174 | |
| 1 | 9857 | |
| 2 | 9895 | |
| 3 | 10030 | |
| 4 | 9924 |
product_return_rate
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 51 | 51 |
| Distinct (%) | < 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.25013741 | 0.2512543333 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 0.5 | 0.5 |
| Zeros | 9960 | 315 |
| Zeros (%) | 1.0% | 1.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0.03 | 0.02 |
| Q1 | 0.13 | 0.13 |
| median | 0.25 | 0.25 |
| Q3 | 0.38 | 0.38 |
| 95-th percentile | 0.48 | 0.48 |
| Maximum | 0.5 | 0.5 |
| Range | 0.5 | 0.5 |
| Interquartile range (IQR) | 0.25 | 0.25 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 0.1444084896 | 0.1451581402 |
| Coefficient of variation (CV) | 0.5773166421 | 0.5777338775 |
| Kurtosis | -1.197824771 | -1.207624816 |
| Mean | 0.25013741 | 0.2512543333 |
| Median Absolute Deviation (MAD) | 0.13 | 0.13 |
| Skewness | -0.0005165569762 | -0.007170390778 |
| Sum | 250137.41 | 7537.63 |
| Variance | 0.02085381187 | 0.02107088568 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 0.43 | 20287 | 2.0% |
| 0.38 | 20282 | 2.0% |
| 0.03 | 20242 | 2.0% |
| 0.46 | 20215 | 2.0% |
| 0.4 | 20209 | 2.0% |
| 0.14 | 20164 | 2.0% |
| 0.45 | 20148 | 2.0% |
| 0.16 | 20140 | 2.0% |
| 0.06 | 20135 | 2.0% |
| 0.29 | 20118 | 2.0% |
| Other values (41) | 798060 |
| Value | Count | Frequency (%) |
| 0.4 | 676 | 2.3% |
| 0.46 | 668 | 2.2% |
| 0.45 | 633 | 2.1% |
| 0.27 | 632 | 2.1% |
| 0.21 | 631 | 2.1% |
| 0.49 | 631 | 2.1% |
| 0.43 | 630 | 2.1% |
| 0.14 | 628 | 2.1% |
| 0.19 | 626 | 2.1% |
| 0.48 | 626 | 2.1% |
| Other values (41) | 23619 |
| Value | Count | Frequency (%) |
| 0 | 9960 | |
| 0.01 | 19921 | |
| 0.02 | 19994 | |
| 0.03 | 20242 | |
| 0.04 | 19825 |
| Value | Count | Frequency (%) |
| 0 | 315 | |
| 0.01 | 594 | |
| 0.02 | 610 | |
| 0.03 | 609 | |
| 0.04 | 604 |
| Value | Count | Frequency (%) |
| 0 | 315 | |
| 0.01 | 594 | |
| 0.02 | 610 | |
| 0.03 | 609 | |
| 0.04 | 604 |
| Value | Count | Frequency (%) |
| 0 | 9960 | |
| 0.01 | 19921 | |
| 0.02 | 19994 | |
| 0.03 | 20242 | |
| 0.04 | 19825 |
product_size
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 5 | 5 |
| Mean length | 5.333501 | 5.3283 |
| Min length | 5 | 5 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Small | Small |
| 2nd row | Medium | Medium |
| 3rd row | Medium | Small |
| 4th row | Large | Small |
| 5th row | Small | Medium |
| Value | Count | Frequency (%) |
| large | 333964 | |
| medium | 333501 | |
| small | 332535 |
| Value | Count | Frequency (%) |
| large | 10123 | |
| small | 10028 | |
| medium | 9849 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 667465 | |
| a | 666499 | |
| m | 666036 | |
| l | 665070 | |
| g | 333964 | |
| r | 333964 | |
| L | 333964 | |
| M | 333501 | |
| i | 333501 | |
| d | 333501 | |
| Other values (2) | 666036 |
| Value | Count | Frequency (%) |
| a | 20151 | |
| l | 20056 | |
| e | 19972 | |
| m | 19877 | |
| g | 10123 | |
| r | 10123 | |
| L | 10123 | |
| S | 10028 | |
| M | 9849 | |
| d | 9849 | |
| Other values (2) | 19698 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5333501 |
| Value | Count | Frequency (%) |
| (unknown) | 159849 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 667465 | |
| a | 666499 | |
| m | 666036 | |
| l | 665070 | |
| g | 333964 | |
| r | 333964 | |
| L | 333964 | |
| M | 333501 | |
| i | 333501 | |
| d | 333501 | |
| Other values (2) | 666036 |
| Value | Count | Frequency (%) |
| a | 20151 | |
| l | 20056 | |
| e | 19972 | |
| m | 19877 | |
| g | 10123 | |
| r | 10123 | |
| L | 10123 | |
| S | 10028 | |
| M | 9849 | |
| d | 9849 | |
| Other values (2) | 19698 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5333501 |
| Value | Count | Frequency (%) |
| (unknown) | 159849 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 667465 | |
| a | 666499 | |
| m | 666036 | |
| l | 665070 | |
| g | 333964 | |
| r | 333964 | |
| L | 333964 | |
| M | 333501 | |
| i | 333501 | |
| d | 333501 | |
| Other values (2) | 666036 |
| Value | Count | Frequency (%) |
| a | 20151 | |
| l | 20056 | |
| e | 19972 | |
| m | 19877 | |
| g | 10123 | |
| r | 10123 | |
| L | 10123 | |
| S | 10028 | |
| M | 9849 | |
| d | 9849 | |
| Other values (2) | 19698 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5333501 |
| Value | Count | Frequency (%) |
| (unknown) | 159849 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 667465 | |
| a | 666499 | |
| m | 666036 | |
| l | 665070 | |
| g | 333964 | |
| r | 333964 | |
| L | 333964 | |
| M | 333501 | |
| i | 333501 | |
| d | 333501 | |
| Other values (2) | 666036 |
| Value | Count | Frequency (%) |
| a | 20151 | |
| l | 20056 | |
| e | 19972 | |
| m | 19877 | |
| g | 10123 | |
| r | 10123 | |
| L | 10123 | |
| S | 10028 | |
| M | 9849 | |
| d | 9849 | |
| Other values (2) | 19698 |
product_weight
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 991 | 991 |
| Distinct (%) | 0.1% | 3.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 5.05437238 | 5.061463 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0.1 | 0.1 |
| Maximum | 10 | 10 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0.1 | 0.1 |
| 5-th percentile | 0.6 | 0.62 |
| Q1 | 2.58 | 2.58 |
| median | 5.06 | 5.07 |
| Q3 | 7.53 | 7.53 |
| 95-th percentile | 9.5 | 9.5 |
| Maximum | 10 | 10 |
| Range | 9.9 | 9.9 |
| Interquartile range (IQR) | 4.95 | 4.95 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 2.857848487 | 2.855694378 |
| Coefficient of variation (CV) | 0.56542104 | 0.5642033496 |
| Kurtosis | -1.200012392 | -1.196524814 |
| Mean | 5.05437238 | 5.061463 |
| Median Absolute Deviation (MAD) | 2.47 | 2.47 |
| Skewness | -0.001975515497 | -0.0007373572724 |
| Sum | 5054372.38 | 151843.89 |
| Variance | 8.167297977 | 8.154990383 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 2.51 | 1094 | 0.1% |
| 7.79 | 1092 | 0.1% |
| 3.96 | 1089 | 0.1% |
| 3.55 | 1089 | 0.1% |
| 1.61 | 1088 | 0.1% |
| 5.24 | 1088 | 0.1% |
| 1.74 | 1087 | 0.1% |
| 4.66 | 1085 | 0.1% |
| 3.04 | 1082 | 0.1% |
| 1.22 | 1081 | 0.1% |
| Other values (981) | 989125 |
| Value | Count | Frequency (%) |
| 8.14 | 49 | 0.2% |
| 1.76 | 47 | 0.2% |
| 5.85 | 46 | 0.2% |
| 9.93 | 45 | 0.1% |
| 2.2 | 45 | 0.1% |
| 8.5 | 45 | 0.1% |
| 5.21 | 45 | 0.1% |
| 8.96 | 44 | 0.1% |
| 1.45 | 44 | 0.1% |
| 5.52 | 44 | 0.1% |
| Other values (981) | 29546 |
| Value | Count | Frequency (%) |
| 0.1 | 506 | |
| 0.11 | 1031 | |
| 0.12 | 996 | |
| 0.13 | 1001 | |
| 0.14 | 1007 |
| Value | Count | Frequency (%) |
| 0.1 | 13 | < 0.1% |
| 0.11 | 19 | |
| 0.12 | 34 | |
| 0.13 | 27 | |
| 0.14 | 32 |
| Value | Count | Frequency (%) |
| 0.1 | 13 | < 0.1% |
| 0.11 | 19 | |
| 0.12 | 34 | |
| 0.13 | 27 | |
| 0.14 | 32 |
| Value | Count | Frequency (%) |
| 0.1 | 506 | |
| 0.11 | 1031 | |
| 0.12 | 996 | |
| 0.13 | 1001 | |
| 0.14 | 1007 |
product_color
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 5 | 5 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 5 | 5 |
| Median length | 5 | 5 |
| Mean length | 4.399397 | 4.4086 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Red | Red |
| 2nd row | Blue | Green |
| 3rd row | Green | White |
| 4th row | Blue | White |
| 5th row | Red | White |
| Value | Count | Frequency (%) |
| blue | 200671 | |
| green | 200202 | |
| red | 199966 | |
| black | 199704 | |
| white | 199457 |
| Value | Count | Frequency (%) |
| green | 6117 | |
| white | 6051 | |
| blue | 5980 | |
| black | 5971 | |
| red | 5881 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1000498 | |
| B | 400375 | 9.1% |
| l | 400375 | 9.1% |
| u | 200671 | 4.6% |
| G | 200202 | 4.6% |
| r | 200202 | 4.6% |
| n | 200202 | 4.6% |
| R | 199966 | 4.5% |
| d | 199966 | 4.5% |
| a | 199704 | 4.5% |
| Other values (6) | 1197236 |
| Value | Count | Frequency (%) |
| e | 30146 | |
| B | 11951 | 9.0% |
| l | 11951 | 9.0% |
| G | 6117 | 4.6% |
| r | 6117 | 4.6% |
| n | 6117 | 4.6% |
| W | 6051 | 4.6% |
| h | 6051 | 4.6% |
| t | 6051 | 4.6% |
| i | 6051 | 4.6% |
| Other values (6) | 35655 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4399397 |
| Value | Count | Frequency (%) |
| (unknown) | 132258 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1000498 | |
| B | 400375 | 9.1% |
| l | 400375 | 9.1% |
| u | 200671 | 4.6% |
| G | 200202 | 4.6% |
| r | 200202 | 4.6% |
| n | 200202 | 4.6% |
| R | 199966 | 4.5% |
| d | 199966 | 4.5% |
| a | 199704 | 4.5% |
| Other values (6) | 1197236 |
| Value | Count | Frequency (%) |
| e | 30146 | |
| B | 11951 | 9.0% |
| l | 11951 | 9.0% |
| G | 6117 | 4.6% |
| r | 6117 | 4.6% |
| n | 6117 | 4.6% |
| W | 6051 | 4.6% |
| h | 6051 | 4.6% |
| t | 6051 | 4.6% |
| i | 6051 | 4.6% |
| Other values (6) | 35655 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4399397 |
| Value | Count | Frequency (%) |
| (unknown) | 132258 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1000498 | |
| B | 400375 | 9.1% |
| l | 400375 | 9.1% |
| u | 200671 | 4.6% |
| G | 200202 | 4.6% |
| r | 200202 | 4.6% |
| n | 200202 | 4.6% |
| R | 199966 | 4.5% |
| d | 199966 | 4.5% |
| a | 199704 | 4.5% |
| Other values (6) | 1197236 |
| Value | Count | Frequency (%) |
| e | 30146 | |
| B | 11951 | 9.0% |
| l | 11951 | 9.0% |
| G | 6117 | 4.6% |
| r | 6117 | 4.6% |
| n | 6117 | 4.6% |
| W | 6051 | 4.6% |
| h | 6051 | 4.6% |
| t | 6051 | 4.6% |
| i | 6051 | 4.6% |
| Other values (6) | 35655 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4399397 |
| Value | Count | Frequency (%) |
| (unknown) | 132258 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1000498 | |
| B | 400375 | 9.1% |
| l | 400375 | 9.1% |
| u | 200671 | 4.6% |
| G | 200202 | 4.6% |
| r | 200202 | 4.6% |
| n | 200202 | 4.6% |
| R | 199966 | 4.5% |
| d | 199966 | 4.5% |
| a | 199704 | 4.5% |
| Other values (6) | 1197236 |
| Value | Count | Frequency (%) |
| e | 30146 | |
| B | 11951 | 9.0% |
| l | 11951 | 9.0% |
| G | 6117 | 4.6% |
| r | 6117 | 4.6% |
| n | 6117 | 4.6% |
| W | 6051 | 4.6% |
| h | 6051 | 4.6% |
| t | 6051 | 4.6% |
| i | 6051 | 4.6% |
| Other values (6) | 35655 |
product_material
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 5 | 5 |
| Mean length | 5.25087 | 5.254966667 |
| Min length | 4 | 4 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Metal | Metal |
| 2nd row | Metal | Plastic |
| 3rd row | Plastic | Metal |
| 4th row | Wood | Metal |
| 5th row | Metal | Metal |
| Value | Count | Frequency (%) |
| plastic | 250483 | |
| wood | 250096 | |
| metal | 249896 | |
| glass | 249525 |
| Value | Count | Frequency (%) |
| glass | 7732 | |
| plastic | 7488 | |
| metal | 7453 | |
| wood | 7327 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 749904 | |
| a | 749904 | |
| s | 749533 | |
| t | 500379 | |
| o | 500192 | |
| P | 250483 | 4.8% |
| i | 250483 | 4.8% |
| c | 250483 | 4.8% |
| W | 250096 | 4.8% |
| d | 250096 | 4.8% |
| Other values (3) | 749317 |
| Value | Count | Frequency (%) |
| s | 22952 | |
| a | 22673 | |
| l | 22673 | |
| t | 14941 | |
| o | 14654 | |
| G | 7732 | 4.9% |
| P | 7488 | 4.7% |
| c | 7488 | 4.7% |
| i | 7488 | 4.7% |
| M | 7453 | 4.7% |
| Other values (3) | 22107 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5250870 |
| Value | Count | Frequency (%) |
| (unknown) | 157649 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 749904 | |
| a | 749904 | |
| s | 749533 | |
| t | 500379 | |
| o | 500192 | |
| P | 250483 | 4.8% |
| i | 250483 | 4.8% |
| c | 250483 | 4.8% |
| W | 250096 | 4.8% |
| d | 250096 | 4.8% |
| Other values (3) | 749317 |
| Value | Count | Frequency (%) |
| s | 22952 | |
| a | 22673 | |
| l | 22673 | |
| t | 14941 | |
| o | 14654 | |
| G | 7732 | 4.9% |
| P | 7488 | 4.7% |
| c | 7488 | 4.7% |
| i | 7488 | 4.7% |
| M | 7453 | 4.7% |
| Other values (3) | 22107 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5250870 |
| Value | Count | Frequency (%) |
| (unknown) | 157649 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 749904 | |
| a | 749904 | |
| s | 749533 | |
| t | 500379 | |
| o | 500192 | |
| P | 250483 | 4.8% |
| i | 250483 | 4.8% |
| c | 250483 | 4.8% |
| W | 250096 | 4.8% |
| d | 250096 | 4.8% |
| Other values (3) | 749317 |
| Value | Count | Frequency (%) |
| s | 22952 | |
| a | 22673 | |
| l | 22673 | |
| t | 14941 | |
| o | 14654 | |
| G | 7732 | 4.9% |
| P | 7488 | 4.7% |
| c | 7488 | 4.7% |
| i | 7488 | 4.7% |
| M | 7453 | 4.7% |
| Other values (3) | 22107 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5250870 |
| Value | Count | Frequency (%) |
| (unknown) | 157649 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 749904 | |
| a | 749904 | |
| s | 749533 | |
| t | 500379 | |
| o | 500192 | |
| P | 250483 | 4.8% |
| i | 250483 | 4.8% |
| c | 250483 | 4.8% |
| W | 250096 | 4.8% |
| d | 250096 | 4.8% |
| Other values (3) | 749317 |
| Value | Count | Frequency (%) |
| s | 22952 | |
| a | 22673 | |
| l | 22673 | |
| t | 14941 | |
| o | 14654 | |
| G | 7732 | 4.9% |
| P | 7488 | 4.7% |
| c | 7488 | 4.7% |
| i | 7488 | 4.7% |
| M | 7453 | 4.7% |
| Other values (3) | 22107 |
product_manufacture_date
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 992037 | 29991 |
| Distinct (%) | 99.2% | > 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 984126 | 29982 ? |
| Unique (%) | 98.4% | 99.9% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | 2019-08-04 01:47:01 | 2019-08-04 01:47:01 |
| 2nd row | 2019-10-23 19:59:17 | 2019-09-19 10:14:11 |
| 3rd row | 2018-05-12 08:00:29 | 2019-02-04 10:47:41 |
| 4th row | 2019-11-15 16:17:29 | 2019-01-04 09:40:43 |
| 5th row | 2019-08-27 02:58:19 | 2019-05-27 11:21:51 |
| Value | Count | Frequency (%) |
| 2018-04-10 | 1514 | 0.1% |
| 2019-03-19 | 1490 | 0.1% |
| 2018-02-26 | 1471 | 0.1% |
| 2018-06-18 | 1467 | 0.1% |
| 2018-09-24 | 1462 | 0.1% |
| 2019-01-25 | 1457 | 0.1% |
| 2019-01-30 | 1456 | 0.1% |
| 2018-04-28 | 1454 | 0.1% |
| 2019-07-04 | 1453 | 0.1% |
| 2019-01-09 | 1453 | 0.1% |
| Other values (87119) | 1985323 |
| Value | Count | Frequency (%) |
| 2019-10-10 | 66 | 0.1% |
| 2018-05-25 | 60 | 0.1% |
| 2018-03-20 | 57 | 0.1% |
| 2018-02-25 | 57 | 0.1% |
| 2018-08-27 | 57 | 0.1% |
| 2018-01-23 | 56 | 0.1% |
| 2018-01-06 | 55 | 0.1% |
| 2019-11-21 | 55 | 0.1% |
| 2018-06-16 | 55 | 0.1% |
| 2019-05-12 | 55 | 0.1% |
| Other values (26164) | 59427 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3297851 | |
| 1 | 2942646 | |
| 2 | 2411227 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 8 | 967248 | 5.1% |
| 9 | 961589 | 5.1% |
| 3 | 891404 | 4.7% |
| 5 | 799916 | 4.2% |
| Other values (3) | 1728119 |
| Value | Count | Frequency (%) |
| 0 | 98980 | |
| 1 | 88196 | |
| 2 | 72387 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 9 | 28977 | 5.1% |
| 8 | 28940 | 5.1% |
| 3 | 26720 | 4.7% |
| 5 | 24008 | 4.2% |
| Other values (3) | 51792 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3297851 | |
| 1 | 2942646 | |
| 2 | 2411227 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 8 | 967248 | 5.1% |
| 9 | 961589 | 5.1% |
| 3 | 891404 | 4.7% |
| 5 | 799916 | 4.2% |
| Other values (3) | 1728119 |
| Value | Count | Frequency (%) |
| 0 | 98980 | |
| 1 | 88196 | |
| 2 | 72387 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 9 | 28977 | 5.1% |
| 8 | 28940 | 5.1% |
| 3 | 26720 | 4.7% |
| 5 | 24008 | 4.2% |
| Other values (3) | 51792 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3297851 | |
| 1 | 2942646 | |
| 2 | 2411227 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 8 | 967248 | 5.1% |
| 9 | 961589 | 5.1% |
| 3 | 891404 | 4.7% |
| 5 | 799916 | 4.2% |
| Other values (3) | 1728119 |
| Value | Count | Frequency (%) |
| 0 | 98980 | |
| 1 | 88196 | |
| 2 | 72387 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 9 | 28977 | 5.1% |
| 8 | 28940 | 5.1% |
| 3 | 26720 | 4.7% |
| 5 | 24008 | 4.2% |
| Other values (3) | 51792 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3297851 | |
| 1 | 2942646 | |
| 2 | 2411227 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 8 | 967248 | 5.1% |
| 9 | 961589 | 5.1% |
| 3 | 891404 | 4.7% |
| 5 | 799916 | 4.2% |
| Other values (3) | 1728119 |
| Value | Count | Frequency (%) |
| 0 | 98980 | |
| 1 | 88196 | |
| 2 | 72387 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 9 | 28977 | 5.1% |
| 8 | 28940 | 5.1% |
| 3 | 26720 | 4.7% |
| 5 | 24008 | 4.2% |
| Other values (3) | 51792 |
product_expiry_date
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 992042 | 29994 |
| Distinct (%) | 99.2% | > 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 984121 | 29988 ? |
| Unique (%) | 98.4% | > 99.9% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | 2022-05-28 14:54:02 | 2022-05-28 14:54:02 |
| 2nd row | 2022-12-19 08:04:41 | 2022-12-02 10:27:26 |
| 3rd row | 2023-02-01 12:15:07 | 2022-11-25 00:36:25 |
| 4th row | 2023-02-05 11:46:57 | 2023-04-21 18:13:36 |
| 5th row | 2023-10-05 08:13:07 | 2023-03-09 01:20:13 |
| Value | Count | Frequency (%) |
| 2022-12-22 | 1476 | 0.1% |
| 2022-06-08 | 1475 | 0.1% |
| 2022-01-28 | 1473 | 0.1% |
| 2023-03-13 | 1472 | 0.1% |
| 2022-10-06 | 1468 | 0.1% |
| 2022-06-27 | 1468 | 0.1% |
| 2023-07-08 | 1457 | 0.1% |
| 2023-07-23 | 1457 | 0.1% |
| 2023-01-11 | 1452 | 0.1% |
| 2022-04-17 | 1450 | 0.1% |
| Other values (87119) | 1985352 |
| Value | Count | Frequency (%) |
| 2022-09-23 | 63 | 0.1% |
| 2023-09-19 | 60 | 0.1% |
| 2022-07-26 | 59 | 0.1% |
| 2022-02-19 | 59 | 0.1% |
| 2022-11-29 | 57 | 0.1% |
| 2023-10-05 | 56 | 0.1% |
| 2023-12-10 | 56 | 0.1% |
| 2022-12-18 | 56 | 0.1% |
| 2022-08-25 | 55 | 0.1% |
| 2022-07-19 | 55 | 0.1% |
| Other values (26049) | 59424 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3911569 | |
| 0 | 3298912 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1939371 | |
| 3 | 1392537 | 7.3% |
| 1000000 | 5.3% | |
| 5 | 800253 | 4.2% |
| 4 | 798405 | 4.2% |
| 8 | 467536 | 2.5% |
| Other values (3) | 1391417 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 117442 | |
| 0 | 99034 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58229 | |
| 3 | 41580 | 7.3% |
| 30000 | 5.3% | |
| 5 | 24249 | 4.3% |
| 4 | 23712 | 4.2% |
| 7 | 14077 | 2.5% |
| Other values (3) | 41677 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3911569 | |
| 0 | 3298912 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1939371 | |
| 3 | 1392537 | 7.3% |
| 1000000 | 5.3% | |
| 5 | 800253 | 4.2% |
| 4 | 798405 | 4.2% |
| 8 | 467536 | 2.5% |
| Other values (3) | 1391417 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 117442 | |
| 0 | 99034 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58229 | |
| 3 | 41580 | 7.3% |
| 30000 | 5.3% | |
| 5 | 24249 | 4.3% |
| 4 | 23712 | 4.2% |
| 7 | 14077 | 2.5% |
| Other values (3) | 41677 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3911569 | |
| 0 | 3298912 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1939371 | |
| 3 | 1392537 | 7.3% |
| 1000000 | 5.3% | |
| 5 | 800253 | 4.2% |
| 4 | 798405 | 4.2% |
| 8 | 467536 | 2.5% |
| Other values (3) | 1391417 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 117442 | |
| 0 | 99034 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58229 | |
| 3 | 41580 | 7.3% |
| 30000 | 5.3% | |
| 5 | 24249 | 4.3% |
| 4 | 23712 | 4.2% |
| 7 | 14077 | 2.5% |
| Other values (3) | 41677 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3911569 | |
| 0 | 3298912 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1939371 | |
| 3 | 1392537 | 7.3% |
| 1000000 | 5.3% | |
| 5 | 800253 | 4.2% |
| 4 | 798405 | 4.2% |
| 8 | 467536 | 2.5% |
| Other values (3) | 1391417 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 117442 | |
| 0 | 99034 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58229 | |
| 3 | 41580 | 7.3% |
| 30000 | 5.3% | |
| 5 | 24249 | 4.3% |
| 4 | 23712 | 4.2% |
| 7 | 14077 | 2.5% |
| Other values (3) | 41677 | 7.3% |
product_shelf_life
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 365 | 365 |
| Distinct (%) | < 0.1% | 1.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 181.876207 | 181.6946667 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 364 | 364 |
| Zeros | 2713 | 63 |
| Zeros (%) | 0.3% | 0.2% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 18 | 18 |
| Q1 | 91 | 92 |
| median | 182 | 181 |
| Q3 | 273 | 272 |
| 95-th percentile | 346 | 346 |
| Maximum | 364 | 364 |
| Range | 364 | 364 |
| Interquartile range (IQR) | 182 | 180 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 105.2288552 | 104.9073453 |
| Coefficient of variation (CV) | 0.5785740585 | 0.5773826342 |
| Kurtosis | -1.198082782 | -1.187975675 |
| Mean | 181.876207 | 181.6946667 |
| Median Absolute Deviation (MAD) | 91 | 90 |
| Skewness | 0.0006229204449 | 0.002701780176 |
| Sum | 181876207 | 5450840 |
| Variance | 11073.11197 | 11005.55109 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 87 | 2893 | 0.3% |
| 272 | 2874 | 0.3% |
| 70 | 2870 | 0.3% |
| 250 | 2870 | 0.3% |
| 210 | 2862 | 0.3% |
| 224 | 2859 | 0.3% |
| 238 | 2857 | 0.3% |
| 33 | 2848 | 0.3% |
| 297 | 2847 | 0.3% |
| 171 | 2845 | 0.3% |
| Other values (355) | 971375 |
| Value | Count | Frequency (%) |
| 125 | 111 | 0.4% |
| 5 | 110 | 0.4% |
| 248 | 109 | 0.4% |
| 273 | 107 | 0.4% |
| 322 | 107 | 0.4% |
| 285 | 106 | 0.4% |
| 82 | 102 | 0.3% |
| 93 | 102 | 0.3% |
| 306 | 101 | 0.3% |
| 291 | 101 | 0.3% |
| Other values (355) | 28944 |
| Value | Count | Frequency (%) |
| 0 | 2713 | |
| 1 | 2788 | |
| 2 | 2776 | |
| 3 | 2725 | |
| 4 | 2788 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 1 | 76 | |
| 2 | 77 | |
| 3 | 94 | |
| 4 | 78 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 1 | 76 | |
| 2 | 77 | |
| 3 | 94 | |
| 4 | 78 |
| Value | Count | Frequency (%) |
| 0 | 2713 | |
| 1 | 2788 | |
| 2 | 2776 | |
| 3 | 2725 | |
| 4 | 2788 |
promotion_id
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 999 | 999 |
| Distinct (%) | 0.1% | 3.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 499.920037 | 498.2946667 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 999 | 999 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 50 | 51.95 |
| Q1 | 250 | 249 |
| median | 500 | 499 |
| Q3 | 750 | 746 |
| 95-th percentile | 949 | 949 |
| Maximum | 999 | 999 |
| Range | 998 | 998 |
| Interquartile range (IQR) | 500 | 497 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 288.4530565 | 288.2730145 |
| Coefficient of variation (CV) | 0.57699839 | 0.578519165 |
| Kurtosis | -1.200677574 | -1.203212649 |
| Mean | 499.920037 | 498.2946667 |
| Median Absolute Deviation (MAD) | 250 | 248 |
| Skewness | -0.0008935044332 | 0.008814299235 |
| Sum | 499920037 | 14948840 |
| Variance | 83205.16579 | 83101.33088 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 52 | 1092 | 0.1% |
| 94 | 1082 | 0.1% |
| 374 | 1079 | 0.1% |
| 117 | 1077 | 0.1% |
| 29 | 1075 | 0.1% |
| 603 | 1075 | 0.1% |
| 512 | 1073 | 0.1% |
| 949 | 1073 | 0.1% |
| 885 | 1073 | 0.1% |
| 51 | 1070 | 0.1% |
| Other values (989) | 989231 |
| Value | Count | Frequency (%) |
| 462 | 49 | 0.2% |
| 396 | 48 | 0.2% |
| 793 | 48 | 0.2% |
| 666 | 47 | 0.2% |
| 311 | 46 | 0.2% |
| 306 | 46 | 0.2% |
| 740 | 45 | 0.1% |
| 670 | 45 | 0.1% |
| 361 | 45 | 0.1% |
| 282 | 44 | 0.1% |
| Other values (989) | 29537 |
| Value | Count | Frequency (%) |
| 1 | 1033 | |
| 2 | 995 | |
| 3 | 1036 | |
| 4 | 1024 | |
| 5 | 992 |
| Value | Count | Frequency (%) |
| 1 | 40 | |
| 2 | 31 | |
| 3 | 30 | |
| 4 | 42 | |
| 5 | 37 |
| Value | Count | Frequency (%) |
| 1 | 40 | |
| 2 | 31 | |
| 3 | 30 | |
| 4 | 42 | |
| 5 | 37 |
| Value | Count | Frequency (%) |
| 1 | 1033 | |
| 2 | 995 | |
| 3 | 1036 | |
| 4 | 1024 | |
| 5 | 992 |
promotion_type
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 20 | 20 |
| Median length | 10 | 10 |
| Mean length | 12.334064 | 12.36263333 |
| Min length | 7 | 7 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | 20% Off | 20% Off |
| 2nd row | Flash Sale | Flash Sale |
| 3rd row | Flash Sale | Flash Sale |
| 4th row | Buy One Get One Free | Buy One Get One Free |
| 5th row | Flash Sale | Flash Sale |
| Value | Count | Frequency (%) |
| one | 667040 | |
| 20 | 333712 | |
| off | 333712 | |
| buy | 333520 | |
| get | 333520 | |
| free | 333520 | |
| flash | 332768 | |
| sale | 332768 |
| Value | Count | Frequency (%) |
| one | 20174 | |
| buy | 10087 | |
| get | 10087 | |
| free | 10087 | |
| 20 | 9997 | |
| off | 9997 | |
| flash | 9916 | |
| sale | 9916 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2000560 | ||
| e | 2000368 | |
| O | 1000752 | 8.1% |
| f | 667424 | 5.4% |
| n | 667040 | 5.4% |
| F | 666288 | 5.4% |
| a | 665536 | 5.4% |
| l | 665536 | 5.4% |
| % | 333712 | 2.7% |
| 0 | 333712 | 2.7% |
| Other values (10) | 3333136 |
| Value | Count | Frequency (%) |
| e | 60351 | |
| 60261 | ||
| O | 30171 | 8.1% |
| n | 20174 | 5.4% |
| F | 20003 | 5.4% |
| f | 19994 | 5.4% |
| a | 19832 | 5.3% |
| l | 19832 | 5.3% |
| y | 10087 | 2.7% |
| u | 10087 | 2.7% |
| Other values (10) | 100087 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12334064 |
| Value | Count | Frequency (%) |
| (unknown) | 370879 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2000560 | ||
| e | 2000368 | |
| O | 1000752 | 8.1% |
| f | 667424 | 5.4% |
| n | 667040 | 5.4% |
| F | 666288 | 5.4% |
| a | 665536 | 5.4% |
| l | 665536 | 5.4% |
| % | 333712 | 2.7% |
| 0 | 333712 | 2.7% |
| Other values (10) | 3333136 |
| Value | Count | Frequency (%) |
| e | 60351 | |
| 60261 | ||
| O | 30171 | 8.1% |
| n | 20174 | 5.4% |
| F | 20003 | 5.4% |
| f | 19994 | 5.4% |
| a | 19832 | 5.3% |
| l | 19832 | 5.3% |
| y | 10087 | 2.7% |
| u | 10087 | 2.7% |
| Other values (10) | 100087 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12334064 |
| Value | Count | Frequency (%) |
| (unknown) | 370879 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2000560 | ||
| e | 2000368 | |
| O | 1000752 | 8.1% |
| f | 667424 | 5.4% |
| n | 667040 | 5.4% |
| F | 666288 | 5.4% |
| a | 665536 | 5.4% |
| l | 665536 | 5.4% |
| % | 333712 | 2.7% |
| 0 | 333712 | 2.7% |
| Other values (10) | 3333136 |
| Value | Count | Frequency (%) |
| e | 60351 | |
| 60261 | ||
| O | 30171 | 8.1% |
| n | 20174 | 5.4% |
| F | 20003 | 5.4% |
| f | 19994 | 5.4% |
| a | 19832 | 5.3% |
| l | 19832 | 5.3% |
| y | 10087 | 2.7% |
| u | 10087 | 2.7% |
| Other values (10) | 100087 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12334064 |
| Value | Count | Frequency (%) |
| (unknown) | 370879 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2000560 | ||
| e | 2000368 | |
| O | 1000752 | 8.1% |
| f | 667424 | 5.4% |
| n | 667040 | 5.4% |
| F | 666288 | 5.4% |
| a | 665536 | 5.4% |
| l | 665536 | 5.4% |
| % | 333712 | 2.7% |
| 0 | 333712 | 2.7% |
| Other values (10) | 3333136 |
| Value | Count | Frequency (%) |
| e | 60351 | |
| 60261 | ||
| O | 30171 | 8.1% |
| n | 20174 | 5.4% |
| F | 20003 | 5.4% |
| f | 19994 | 5.4% |
| a | 19832 | 5.3% |
| l | 19832 | 5.3% |
| y | 10087 | 2.7% |
| u | 10087 | 2.7% |
| Other values (10) | 100087 |
promotion_start_date
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 984258 | 29984 |
| Distinct (%) | 98.4% | 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 968681 | 29968 ? |
| Unique (%) | 96.9% | 99.9% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | 2021-07-14 14:28:42 | 2021-07-14 14:28:42 |
| 2nd row | 2021-09-23 04:26:09 | 2021-06-04 11:16:45 |
| 3rd row | 2021-06-13 12:31:15 | 2021-12-31 18:17:24 |
| 4th row | 2021-05-23 05:42:48 | 2021-02-14 04:10:44 |
| 5th row | 2021-04-19 04:55:32 | 2021-09-23 11:20:57 |
| Value | Count | Frequency (%) |
| 2021-03-05 | 2885 | 0.1% |
| 2021-02-07 | 2874 | 0.1% |
| 2021-06-23 | 2871 | 0.1% |
| 2021-05-15 | 2867 | 0.1% |
| 2021-08-27 | 2863 | 0.1% |
| 2021-11-04 | 2862 | 0.1% |
| 2021-03-25 | 2858 | 0.1% |
| 2021-09-06 | 2854 | 0.1% |
| 2021-12-21 | 2851 | 0.1% |
| 2021-08-06 | 2850 | 0.1% |
| Other values (86754) | 1971365 |
| Value | Count | Frequency (%) |
| 2021-09-08 | 116 | 0.2% |
| 2021-06-29 | 108 | 0.2% |
| 2021-09-21 | 108 | 0.2% |
| 2021-07-28 | 106 | 0.2% |
| 2021-04-21 | 106 | 0.2% |
| 2021-09-20 | 105 | 0.2% |
| 2021-03-04 | 103 | 0.2% |
| 2021-09-23 | 102 | 0.2% |
| 2021-03-14 | 101 | 0.2% |
| 2021-11-14 | 101 | 0.2% |
| Other values (25798) | 58944 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3411867 | |
| 0 | 3300257 | |
| 1 | 2938965 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890896 | 4.7% |
| 5 | 800172 | 4.2% |
| 4 | 796605 | 4.2% |
| 7 | 468643 | 2.5% |
| Other values (3) | 1392595 |
| Value | Count | Frequency (%) |
| 2 | 102036 | |
| 0 | 99116 | |
| 1 | 88203 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26783 | 4.7% |
| 4 | 24136 | 4.2% |
| 5 | 23982 | 4.2% |
| 8 | 13996 | 2.5% |
| Other values (3) | 41748 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411867 | |
| 0 | 3300257 | |
| 1 | 2938965 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890896 | 4.7% |
| 5 | 800172 | 4.2% |
| 4 | 796605 | 4.2% |
| 7 | 468643 | 2.5% |
| Other values (3) | 1392595 |
| Value | Count | Frequency (%) |
| 2 | 102036 | |
| 0 | 99116 | |
| 1 | 88203 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26783 | 4.7% |
| 4 | 24136 | 4.2% |
| 5 | 23982 | 4.2% |
| 8 | 13996 | 2.5% |
| Other values (3) | 41748 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411867 | |
| 0 | 3300257 | |
| 1 | 2938965 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890896 | 4.7% |
| 5 | 800172 | 4.2% |
| 4 | 796605 | 4.2% |
| 7 | 468643 | 2.5% |
| Other values (3) | 1392595 |
| Value | Count | Frequency (%) |
| 2 | 102036 | |
| 0 | 99116 | |
| 1 | 88203 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26783 | 4.7% |
| 4 | 24136 | 4.2% |
| 5 | 23982 | 4.2% |
| 8 | 13996 | 2.5% |
| Other values (3) | 41748 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3411867 | |
| 0 | 3300257 | |
| 1 | 2938965 | |
| - | 2000000 | |
| : | 2000000 | |
| 1000000 | 5.3% | |
| 3 | 890896 | 4.7% |
| 5 | 800172 | 4.2% |
| 4 | 796605 | 4.2% |
| 7 | 468643 | 2.5% |
| Other values (3) | 1392595 |
| Value | Count | Frequency (%) |
| 2 | 102036 | |
| 0 | 99116 | |
| 1 | 88203 | |
| - | 60000 | |
| : | 60000 | |
| 30000 | 5.3% | |
| 3 | 26783 | 4.7% |
| 4 | 24136 | 4.2% |
| 5 | 23982 | 4.2% |
| 8 | 13996 | 2.5% |
| Other values (3) | 41748 |
promotion_end_date
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 984252 | 29987 |
| Distinct (%) | 98.4% | > 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 19 | 19 |
| Mean length | 19 | 19 |
| Min length | 19 | 19 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 968676 | 29974 ? |
| Unique (%) | 96.9% | 99.9% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | 2022-12-30 13:04:13 | 2022-12-30 13:04:13 |
| 2nd row | 2022-09-13 03:16:26 | 2022-10-18 02:50:42 |
| 3rd row | 2022-03-13 00:53:35 | 2022-06-01 09:36:45 |
| 4th row | 2022-02-06 00:42:30 | 2022-10-04 11:53:46 |
| 5th row | 2022-12-04 13:07:09 | 2022-11-26 17:32:39 |
| Value | Count | Frequency (%) |
| 2022-08-06 | 2905 | 0.1% |
| 2022-03-08 | 2896 | 0.1% |
| 2022-09-28 | 2874 | 0.1% |
| 2022-02-22 | 2873 | 0.1% |
| 2022-09-16 | 2872 | 0.1% |
| 2022-06-10 | 2865 | 0.1% |
| 2022-07-13 | 2858 | 0.1% |
| 2022-12-15 | 2854 | 0.1% |
| 2022-12-31 | 2847 | 0.1% |
| 2022-08-25 | 2842 | 0.1% |
| Other values (86755) | 1971314 |
| Value | Count | Frequency (%) |
| 2022-06-16 | 112 | 0.2% |
| 2022-08-30 | 106 | 0.2% |
| 2022-02-19 | 105 | 0.2% |
| 2022-11-28 | 104 | 0.2% |
| 2022-08-26 | 103 | 0.2% |
| 2022-01-30 | 102 | 0.2% |
| 2022-04-08 | 102 | 0.2% |
| 2022-06-04 | 102 | 0.2% |
| 2022-12-01 | 101 | 0.2% |
| 2022-01-15 | 101 | 0.2% |
| Other values (25716) | 58962 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4411397 | |
| 0 | 3299242 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1940750 | |
| 1000000 | 5.3% | |
| 3 | 890892 | 4.7% |
| 5 | 800852 | 4.2% |
| 4 | 797493 | 4.2% |
| 8 | 467852 | 2.5% |
| Other values (3) | 1391522 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 132318 | |
| 0 | 98859 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58509 | |
| 30000 | 5.3% | |
| 3 | 26731 | 4.7% |
| 5 | 24132 | 4.2% |
| 4 | 23830 | 4.2% |
| 8 | 14042 | 2.5% |
| Other values (3) | 41579 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 4411397 | |
| 0 | 3299242 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1940750 | |
| 1000000 | 5.3% | |
| 3 | 890892 | 4.7% |
| 5 | 800852 | 4.2% |
| 4 | 797493 | 4.2% |
| 8 | 467852 | 2.5% |
| Other values (3) | 1391522 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 132318 | |
| 0 | 98859 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58509 | |
| 30000 | 5.3% | |
| 3 | 26731 | 4.7% |
| 5 | 24132 | 4.2% |
| 4 | 23830 | 4.2% |
| 8 | 14042 | 2.5% |
| Other values (3) | 41579 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 4411397 | |
| 0 | 3299242 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1940750 | |
| 1000000 | 5.3% | |
| 3 | 890892 | 4.7% |
| 5 | 800852 | 4.2% |
| 4 | 797493 | 4.2% |
| 8 | 467852 | 2.5% |
| Other values (3) | 1391522 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 132318 | |
| 0 | 98859 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58509 | |
| 30000 | 5.3% | |
| 3 | 26731 | 4.7% |
| 5 | 24132 | 4.2% |
| 4 | 23830 | 4.2% |
| 8 | 14042 | 2.5% |
| Other values (3) | 41579 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19000000 |
| Value | Count | Frequency (%) |
| (unknown) | 570000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 4411397 | |
| 0 | 3299242 | |
| - | 2000000 | |
| : | 2000000 | |
| 1 | 1940750 | |
| 1000000 | 5.3% | |
| 3 | 890892 | 4.7% |
| 5 | 800852 | 4.2% |
| 4 | 797493 | 4.2% |
| 8 | 467852 | 2.5% |
| Other values (3) | 1391522 | 7.3% |
| Value | Count | Frequency (%) |
| 2 | 132318 | |
| 0 | 98859 | |
| - | 60000 | |
| : | 60000 | |
| 1 | 58509 | |
| 30000 | 5.3% | |
| 3 | 26731 | 4.7% |
| 5 | 24132 | 4.2% |
| 4 | 23830 | 4.2% |
| 8 | 14042 | 2.5% |
| Other values (3) | 41579 | 7.3% |
promotion_effectiveness
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 4 | 4 |
| Mean length | 4.333407 | 4.339933333 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | High | High |
| 2nd row | Low | Medium |
| 3rd row | Low | High |
| 4th row | High | High |
| 5th row | Medium | High |
| Value | Count | Frequency (%) |
| high | 333660 | |
| medium | 333249 | |
| low | 333091 |
| Value | Count | Frequency (%) |
| medium | 10112 | |
| low | 10026 | |
| high | 9862 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 666909 | |
| H | 333660 | |
| g | 333660 | |
| h | 333660 | |
| M | 333249 | |
| e | 333249 | |
| d | 333249 | |
| u | 333249 | |
| m | 333249 | |
| L | 333091 | |
| Other values (2) | 666182 |
| Value | Count | Frequency (%) |
| i | 19974 | |
| M | 10112 | |
| e | 10112 | |
| d | 10112 | |
| u | 10112 | |
| m | 10112 | |
| L | 10026 | |
| o | 10026 | |
| w | 10026 | |
| H | 9862 | |
| Other values (2) | 19724 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4333407 |
| Value | Count | Frequency (%) |
| (unknown) | 130198 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 666909 | |
| H | 333660 | |
| g | 333660 | |
| h | 333660 | |
| M | 333249 | |
| e | 333249 | |
| d | 333249 | |
| u | 333249 | |
| m | 333249 | |
| L | 333091 | |
| Other values (2) | 666182 |
| Value | Count | Frequency (%) |
| i | 19974 | |
| M | 10112 | |
| e | 10112 | |
| d | 10112 | |
| u | 10112 | |
| m | 10112 | |
| L | 10026 | |
| o | 10026 | |
| w | 10026 | |
| H | 9862 | |
| Other values (2) | 19724 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4333407 |
| Value | Count | Frequency (%) |
| (unknown) | 130198 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 666909 | |
| H | 333660 | |
| g | 333660 | |
| h | 333660 | |
| M | 333249 | |
| e | 333249 | |
| d | 333249 | |
| u | 333249 | |
| m | 333249 | |
| L | 333091 | |
| Other values (2) | 666182 |
| Value | Count | Frequency (%) |
| i | 19974 | |
| M | 10112 | |
| e | 10112 | |
| d | 10112 | |
| u | 10112 | |
| m | 10112 | |
| L | 10026 | |
| o | 10026 | |
| w | 10026 | |
| H | 9862 | |
| Other values (2) | 19724 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4333407 |
| Value | Count | Frequency (%) |
| (unknown) | 130198 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 666909 | |
| H | 333660 | |
| g | 333660 | |
| h | 333660 | |
| M | 333249 | |
| e | 333249 | |
| d | 333249 | |
| u | 333249 | |
| m | 333249 | |
| L | 333091 | |
| Other values (2) | 666182 |
| Value | Count | Frequency (%) |
| i | 19974 | |
| M | 10112 | |
| e | 10112 | |
| d | 10112 | |
| u | 10112 | |
| m | 10112 | |
| L | 10026 | |
| o | 10026 | |
| w | 10026 | |
| H | 9862 | |
| Other values (2) | 19724 |
promotion_channel
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 12 | 12 |
| Median length | 8 | 8 |
| Mean length | 8.665428 | 8.6592 |
| Min length | 6 | 6 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Online | Online |
| 2nd row | Social Media | In-store |
| 3rd row | Online | In-store |
| 4th row | Social Media | Social Media |
| 5th row | Online | Social Media |
| Value | Count | Frequency (%) |
| online | 333694 | |
| social | 333204 | |
| media | 333204 | |
| in-store | 333102 |
| Value | Count | Frequency (%) |
| online | 10026 | |
| in-store | 10017 | |
| social | 9957 | |
| media | 9957 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1000490 | |
| i | 1000102 | |
| e | 1000000 | |
| l | 666898 | 7.7% |
| a | 666408 | 7.7% |
| o | 666306 | 7.7% |
| O | 333694 | 3.9% |
| S | 333204 | 3.8% |
| c | 333204 | 3.8% |
| 333204 | 3.8% | |
| Other values (7) | 2331918 |
| Value | Count | Frequency (%) |
| n | 30069 | |
| e | 30000 | |
| i | 29940 | |
| l | 19983 | 7.7% |
| o | 19974 | 7.7% |
| a | 19914 | 7.7% |
| O | 10026 | 3.9% |
| I | 10017 | 3.9% |
| - | 10017 | 3.9% |
| t | 10017 | 3.9% |
| Other values (7) | 69819 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8665428 |
| Value | Count | Frequency (%) |
| (unknown) | 259776 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 1000490 | |
| i | 1000102 | |
| e | 1000000 | |
| l | 666898 | 7.7% |
| a | 666408 | 7.7% |
| o | 666306 | 7.7% |
| O | 333694 | 3.9% |
| S | 333204 | 3.8% |
| c | 333204 | 3.8% |
| 333204 | 3.8% | |
| Other values (7) | 2331918 |
| Value | Count | Frequency (%) |
| n | 30069 | |
| e | 30000 | |
| i | 29940 | |
| l | 19983 | 7.7% |
| o | 19974 | 7.7% |
| a | 19914 | 7.7% |
| O | 10026 | 3.9% |
| I | 10017 | 3.9% |
| - | 10017 | 3.9% |
| t | 10017 | 3.9% |
| Other values (7) | 69819 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8665428 |
| Value | Count | Frequency (%) |
| (unknown) | 259776 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 1000490 | |
| i | 1000102 | |
| e | 1000000 | |
| l | 666898 | 7.7% |
| a | 666408 | 7.7% |
| o | 666306 | 7.7% |
| O | 333694 | 3.9% |
| S | 333204 | 3.8% |
| c | 333204 | 3.8% |
| 333204 | 3.8% | |
| Other values (7) | 2331918 |
| Value | Count | Frequency (%) |
| n | 30069 | |
| e | 30000 | |
| i | 29940 | |
| l | 19983 | 7.7% |
| o | 19974 | 7.7% |
| a | 19914 | 7.7% |
| O | 10026 | 3.9% |
| I | 10017 | 3.9% |
| - | 10017 | 3.9% |
| t | 10017 | 3.9% |
| Other values (7) | 69819 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8665428 |
| Value | Count | Frequency (%) |
| (unknown) | 259776 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 1000490 | |
| i | 1000102 | |
| e | 1000000 | |
| l | 666898 | 7.7% |
| a | 666408 | 7.7% |
| o | 666306 | 7.7% |
| O | 333694 | 3.9% |
| S | 333204 | 3.8% |
| c | 333204 | 3.8% |
| 333204 | 3.8% | |
| Other values (7) | 2331918 |
| Value | Count | Frequency (%) |
| n | 30069 | |
| e | 30000 | |
| i | 29940 | |
| l | 19983 | 7.7% |
| o | 19974 | 7.7% |
| a | 19914 | 7.7% |
| O | 10026 | 3.9% |
| I | 10017 | 3.9% |
| - | 10017 | 3.9% |
| t | 10017 | 3.9% |
| Other values (7) | 69819 |
promotion_target_audience
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 19 | 19 |
| Median length | 13 | 19 |
| Mean length | 15.999292 | 16.0132 |
| Min length | 13 | 13 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | New Customers | New Customers |
| 2nd row | New Customers | Returning Customers |
| 3rd row | New Customers | Returning Customers |
| 4th row | Returning Customers | Returning Customers |
| 5th row | New Customers | New Customers |
| Value | Count | Frequency (%) |
| customers | 1000000 | |
| new | 500118 | |
| returning | 499882 |
| Value | Count | Frequency (%) |
| customers | 30000 | |
| returning | 15066 | |
| new | 14934 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2000000 | |
| s | 2000000 | |
| u | 1499882 | |
| r | 1499882 | |
| t | 1499882 | |
| C | 1000000 | |
| 1000000 | ||
| o | 1000000 | |
| m | 1000000 | |
| n | 999764 | 6.2% |
| Other values (5) | 2499882 |
| Value | Count | Frequency (%) |
| e | 60000 | |
| s | 60000 | |
| t | 45066 | |
| r | 45066 | |
| u | 45066 | |
| n | 30132 | |
| 30000 | 6.2% | |
| m | 30000 | 6.2% |
| o | 30000 | 6.2% |
| C | 30000 | 6.2% |
| Other values (5) | 75066 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15999292 |
| Value | Count | Frequency (%) |
| (unknown) | 480396 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2000000 | |
| s | 2000000 | |
| u | 1499882 | |
| r | 1499882 | |
| t | 1499882 | |
| C | 1000000 | |
| 1000000 | ||
| o | 1000000 | |
| m | 1000000 | |
| n | 999764 | 6.2% |
| Other values (5) | 2499882 |
| Value | Count | Frequency (%) |
| e | 60000 | |
| s | 60000 | |
| t | 45066 | |
| r | 45066 | |
| u | 45066 | |
| n | 30132 | |
| 30000 | 6.2% | |
| m | 30000 | 6.2% |
| o | 30000 | 6.2% |
| C | 30000 | 6.2% |
| Other values (5) | 75066 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15999292 |
| Value | Count | Frequency (%) |
| (unknown) | 480396 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2000000 | |
| s | 2000000 | |
| u | 1499882 | |
| r | 1499882 | |
| t | 1499882 | |
| C | 1000000 | |
| 1000000 | ||
| o | 1000000 | |
| m | 1000000 | |
| n | 999764 | 6.2% |
| Other values (5) | 2499882 |
| Value | Count | Frequency (%) |
| e | 60000 | |
| s | 60000 | |
| t | 45066 | |
| r | 45066 | |
| u | 45066 | |
| n | 30132 | |
| 30000 | 6.2% | |
| m | 30000 | 6.2% |
| o | 30000 | 6.2% |
| C | 30000 | 6.2% |
| Other values (5) | 75066 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15999292 |
| Value | Count | Frequency (%) |
| (unknown) | 480396 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2000000 | |
| s | 2000000 | |
| u | 1499882 | |
| r | 1499882 | |
| t | 1499882 | |
| C | 1000000 | |
| 1000000 | ||
| o | 1000000 | |
| m | 1000000 | |
| n | 999764 | 6.2% |
| Other values (5) | 2499882 |
| Value | Count | Frequency (%) |
| e | 60000 | |
| s | 60000 | |
| t | 45066 | |
| r | 45066 | |
| u | 45066 | |
| n | 30132 | |
| 30000 | 6.2% | |
| m | 30000 | 6.2% |
| o | 30000 | 6.2% |
| C | 30000 | 6.2% |
| Other values (5) | 75066 |
customer_zip_code
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 89999 | 25475 |
| Distinct (%) | 9.0% | 84.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 54993.64477 | 55064.10053 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10000 | 10010 |
| Maximum | 99998 | 99997 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10000 | 10010 |
| 5-th percentile | 14491 | 14558.8 |
| Q1 | 32477.75 | 32675.25 |
| median | 54966 | 54964.5 |
| Q3 | 77493 | 77524 |
| 95-th percentile | 95497 | 95528.25 |
| Maximum | 99998 | 99997 |
| Range | 89998 | 89987 |
| Interquartile range (IQR) | 45015.25 | 44848.75 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 25975.8078 | 25916.44864 |
| Coefficient of variation (CV) | 0.4723419934 | 0.4706596201 |
| Kurtosis | -1.199859176 | -1.188946149 |
| Mean | 54993.64477 | 55064.10053 |
| Median Absolute Deviation (MAD) | 22509 | 22420.5 |
| Skewness | 0.00079246458 | 0.00166168672 |
| Sum | 5.499364477 × 1010 | 1651923016 |
| Variance | 674742590.8 | 671662310.2 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 41138 | 27 | < 0.1% |
| 28225 | 27 | < 0.1% |
| 19719 | 27 | < 0.1% |
| 25427 | 27 | < 0.1% |
| 95120 | 26 | < 0.1% |
| 38515 | 26 | < 0.1% |
| 54735 | 26 | < 0.1% |
| 21109 | 26 | < 0.1% |
| 17611 | 25 | < 0.1% |
| 82394 | 25 | < 0.1% |
| Other values (89989) | 999738 |
| Value | Count | Frequency (%) |
| 85608 | 5 | < 0.1% |
| 31400 | 4 | < 0.1% |
| 91851 | 4 | < 0.1% |
| 46968 | 4 | < 0.1% |
| 69565 | 4 | < 0.1% |
| 92038 | 4 | < 0.1% |
| 35166 | 4 | < 0.1% |
| 29928 | 4 | < 0.1% |
| 53168 | 4 | < 0.1% |
| 13987 | 4 | < 0.1% |
| Other values (25465) | 29959 |
| Value | Count | Frequency (%) |
| 10000 | 12 | |
| 10001 | 14 | |
| 10002 | 6 | |
| 10003 | 12 | |
| 10004 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 10010 | 1 | |
| 10016 | 1 | |
| 10017 | 2 | |
| 10018 | 1 | |
| 10020 | 1 |
| Value | Count | Frequency (%) |
| 10010 | 1 | |
| 10016 | 1 | |
| 10017 | 2 | |
| 10018 | 1 | |
| 10020 | 1 |
| Value | Count | Frequency (%) |
| 10000 | 12 | |
| 10001 | 14 | |
| 10002 | 6 | |
| 10003 | 12 | |
| 10004 | 5 | < 0.1% |
customer_city
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 6 | 6 |
| Mean length | 6 | 6 |
| Min length | 6 | 6 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | City D | City D |
| 2nd row | City A | City D |
| 3rd row | City B | City C |
| 4th row | City A | City A |
| 5th row | City B | City A |
| Value | Count | Frequency (%) |
| city | 1000000 | |
| b | 250788 | 12.5% |
| c | 249955 | 12.5% |
| a | 249698 | 12.5% |
| d | 249559 | 12.5% |
| Value | Count | Frequency (%) |
| city | 30000 | |
| d | 7560 | 12.6% |
| b | 7532 | 12.6% |
| c | 7460 | 12.4% |
| a | 7448 | 12.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1249955 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| B | 250788 | 4.2% |
| A | 249698 | 4.2% |
| D | 249559 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37460 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| D | 7560 | 4.2% |
| B | 7532 | 4.2% |
| A | 7448 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 1249955 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| B | 250788 | 4.2% |
| A | 249698 | 4.2% |
| D | 249559 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37460 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| D | 7560 | 4.2% |
| B | 7532 | 4.2% |
| A | 7448 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 1249955 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| B | 250788 | 4.2% |
| A | 249698 | 4.2% |
| D | 249559 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37460 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| D | 7560 | 4.2% |
| B | 7532 | 4.2% |
| A | 7448 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 1249955 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| B | 250788 | 4.2% |
| A | 249698 | 4.2% |
| D | 249559 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37460 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| D | 7560 | 4.2% |
| B | 7532 | 4.2% |
| A | 7448 | 4.1% |
customer_state
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 7 | 7 |
| Mean length | 7 | 7 |
| Min length | 7 | 7 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | State Y | State Y |
| 2nd row | State X | State X |
| 3rd row | State X | State X |
| 4th row | State Y | State X |
| 5th row | State Z | State X |
| Value | Count | Frequency (%) |
| state | 1000000 | |
| z | 333674 | 16.7% |
| x | 333196 | 16.7% |
| y | 333130 | 16.7% |
| Value | Count | Frequency (%) |
| state | 30000 | |
| z | 10166 | 16.9% |
| x | 9984 | 16.6% |
| y | 9850 | 16.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| Z | 333674 | 4.8% |
| X | 333196 | 4.8% |
| Y | 333130 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| Z | 10166 | 4.8% |
| X | 9984 | 4.8% |
| Y | 9850 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| Z | 333674 | 4.8% |
| X | 333196 | 4.8% |
| Y | 333130 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| Z | 10166 | 4.8% |
| X | 9984 | 4.8% |
| Y | 9850 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| Z | 333674 | 4.8% |
| X | 333196 | 4.8% |
| Y | 333130 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| Z | 10166 | 4.8% |
| X | 9984 | 4.8% |
| Y | 9850 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| Z | 333674 | 4.8% |
| X | 333196 | 4.8% |
| Y | 333130 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| Z | 10166 | 4.8% |
| X | 9984 | 4.8% |
| Y | 9850 | 4.7% |
store_zip_code
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 89999 | 25596 |
| Distinct (%) | 9.0% | 85.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 54972.76671 | 55034.19067 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10000 | 10002 |
| Maximum | 99998 | 99986 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 10000 | 10002 |
| 5-th percentile | 14488 | 14519.8 |
| Q1 | 32473 | 32677 |
| median | 54961 | 55112 |
| Q3 | 77451 | 77636.5 |
| 95-th percentile | 95470 | 95479.05 |
| Maximum | 99998 | 99986 |
| Range | 89998 | 89984 |
| Interquartile range (IQR) | 44978 | 44959.5 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 25981.48314 | 25923.60613 |
| Coefficient of variation (CV) | 0.4726246229 | 0.4710454686 |
| Kurtosis | -1.200166165 | -1.19592475 |
| Mean | 54972.76671 | 55034.19067 |
| Median Absolute Deviation (MAD) | 22489.5 | 22474.5 |
| Skewness | -0.0001039626203 | 7.98280138 × 10-5 |
| Sum | 5.497276671 × 1010 | 1651025720 |
| Variance | 675037466.1 | 672033354.9 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 20956 | 28 | < 0.1% |
| 29159 | 27 | < 0.1% |
| 59836 | 26 | < 0.1% |
| 54696 | 26 | < 0.1% |
| 92910 | 26 | < 0.1% |
| 43369 | 26 | < 0.1% |
| 26386 | 26 | < 0.1% |
| 90024 | 26 | < 0.1% |
| 27134 | 26 | < 0.1% |
| 20477 | 26 | < 0.1% |
| Other values (89989) | 999737 |
| Value | Count | Frequency (%) |
| 68167 | 5 | < 0.1% |
| 60742 | 5 | < 0.1% |
| 38394 | 4 | < 0.1% |
| 55278 | 4 | < 0.1% |
| 36335 | 4 | < 0.1% |
| 13347 | 4 | < 0.1% |
| 90174 | 4 | < 0.1% |
| 38593 | 4 | < 0.1% |
| 88114 | 4 | < 0.1% |
| 57024 | 4 | < 0.1% |
| Other values (25586) | 29958 |
| Value | Count | Frequency (%) |
| 10000 | 11 | |
| 10001 | 6 | < 0.1% |
| 10002 | 15 | |
| 10003 | 8 | |
| 10004 | 14 |
| Value | Count | Frequency (%) |
| 10002 | 1 | < 0.1% |
| 10012 | 1 | < 0.1% |
| 10013 | 3 | |
| 10014 | 2 | |
| 10016 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10002 | 1 | < 0.1% |
| 10012 | 1 | < 0.1% |
| 10013 | 3 | |
| 10014 | 2 | |
| 10016 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10000 | 11 | |
| 10001 | 6 | < 0.1% |
| 10002 | 15 | |
| 10003 | 8 | |
| 10004 | 14 |
store_city
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 6 | 6 |
| Mean length | 6 | 6 |
| Min length | 6 | 6 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | City D | City D |
| 2nd row | City C | City B |
| 3rd row | City A | City D |
| 4th row | City B | City C |
| 5th row | City C | City D |
| Value | Count | Frequency (%) |
| city | 1000000 | |
| d | 250315 | 12.5% |
| c | 250177 | 12.5% |
| b | 249965 | 12.5% |
| a | 249543 | 12.5% |
| Value | Count | Frequency (%) |
| city | 30000 | |
| d | 7593 | 12.7% |
| b | 7546 | 12.6% |
| a | 7462 | 12.4% |
| c | 7399 | 12.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1250177 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| D | 250315 | 4.2% |
| B | 249965 | 4.2% |
| A | 249543 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37399 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| D | 7593 | 4.2% |
| B | 7546 | 4.2% |
| A | 7462 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 1250177 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| D | 250315 | 4.2% |
| B | 249965 | 4.2% |
| A | 249543 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37399 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| D | 7593 | 4.2% |
| B | 7546 | 4.2% |
| A | 7462 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 1250177 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| D | 250315 | 4.2% |
| B | 249965 | 4.2% |
| A | 249543 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37399 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| D | 7593 | 4.2% |
| B | 7546 | 4.2% |
| A | 7462 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 1250177 | |
| i | 1000000 | |
| t | 1000000 | |
| y | 1000000 | |
| 1000000 | ||
| D | 250315 | 4.2% |
| B | 249965 | 4.2% |
| A | 249543 | 4.2% |
| Value | Count | Frequency (%) |
| C | 37399 | |
| i | 30000 | |
| t | 30000 | |
| y | 30000 | |
| 30000 | ||
| D | 7593 | 4.2% |
| B | 7546 | 4.2% |
| A | 7462 | 4.1% |
store_state
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 7 | 7 |
| Mean length | 7 | 7 |
| Min length | 7 | 7 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | State Y | State Y |
| 2nd row | State X | State Y |
| 3rd row | State Y | State X |
| 4th row | State Z | State Y |
| 5th row | State X | State Z |
| Value | Count | Frequency (%) |
| state | 1000000 | |
| x | 333702 | 16.7% |
| z | 333602 | 16.7% |
| y | 332696 | 16.6% |
| Value | Count | Frequency (%) |
| state | 30000 | |
| x | 10054 | 16.8% |
| y | 10012 | 16.7% |
| z | 9934 | 16.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| X | 333702 | 4.8% |
| Z | 333602 | 4.8% |
| Y | 332696 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| X | 10054 | 4.8% |
| Y | 10012 | 4.8% |
| Z | 9934 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| X | 333702 | 4.8% |
| Z | 333602 | 4.8% |
| Y | 332696 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| X | 10054 | 4.8% |
| Y | 10012 | 4.8% |
| Z | 9934 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| X | 333702 | 4.8% |
| Z | 333602 | 4.8% |
| Y | 332696 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| X | 10054 | 4.8% |
| Y | 10012 | 4.8% |
| Z | 9934 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7000000 |
| Value | Count | Frequency (%) |
| (unknown) | 210000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 2000000 | |
| S | 1000000 | |
| a | 1000000 | |
| e | 1000000 | |
| 1000000 | ||
| X | 333702 | 4.8% |
| Z | 333602 | 4.8% |
| Y | 332696 | 4.8% |
| Value | Count | Frequency (%) |
| t | 60000 | |
| S | 30000 | |
| a | 30000 | |
| e | 30000 | |
| 30000 | ||
| X | 10054 | 4.8% |
| Y | 10012 | 4.8% |
| Z | 9934 | 4.7% |
distance_to_store
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 10001 | 9500 |
| Distinct (%) | 1.0% | 31.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.97910924 | 49.98382167 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 100 | 100 |
| Zeros | 62 | 3 |
| Zeros (%) | < 0.1% | < 0.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 5.03 | 5.14 |
| Q1 | 24.97 | 24.9075 |
| median | 49.96 | 49.99 |
| Q3 | 74.95 | 74.98 |
| 95-th percentile | 94.98 | 95.0305 |
| Maximum | 100 | 100 |
| Range | 100 | 100 |
| Interquartile range (IQR) | 49.98 | 50.0725 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 28.86098911 | 28.84972589 |
| Coefficient of variation (CV) | 0.5774610543 | 0.5771812744 |
| Kurtosis | -1.200199633 | -1.199578709 |
| Mean | 49.97910924 | 49.98382167 |
| Median Absolute Deviation (MAD) | 24.99 | 25.025 |
| Skewness | 0.001218286468 | 0.004047268387 |
| Sum | 49979109.24 | 1499514.65 |
| Variance | 832.9566927 | 832.3066839 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 99.05 | 139 | < 0.1% |
| 0.01 | 138 | < 0.1% |
| 9.68 | 138 | < 0.1% |
| 30.79 | 136 | < 0.1% |
| 31.27 | 135 | < 0.1% |
| 22.61 | 134 | < 0.1% |
| 40.84 | 134 | < 0.1% |
| 78.41 | 134 | < 0.1% |
| 82.37 | 133 | < 0.1% |
| 89.9 | 133 | < 0.1% |
| Other values (9991) | 998646 |
| Value | Count | Frequency (%) |
| 63.94 | 12 | < 0.1% |
| 10.03 | 10 | < 0.1% |
| 21.8 | 10 | < 0.1% |
| 86.61 | 10 | < 0.1% |
| 87.5 | 10 | < 0.1% |
| 26.41 | 10 | < 0.1% |
| 61.87 | 10 | < 0.1% |
| 73.07 | 9 | < 0.1% |
| 44.38 | 9 | < 0.1% |
| 39.03 | 9 | < 0.1% |
| Other values (9490) | 29901 |
| Value | Count | Frequency (%) |
| 0 | 62 | |
| 0.01 | 138 | |
| 0.02 | 88 | |
| 0.03 | 113 | |
| 0.04 | 86 |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 0.01 | 5 | |
| 0.02 | 1 | < 0.1% |
| 0.03 | 2 | < 0.1% |
| 0.04 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 0.01 | 5 | |
| 0.02 | 1 | < 0.1% |
| 0.03 | 2 | < 0.1% |
| 0.04 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 62 | |
| 0.01 | 138 | |
| 0.02 | 88 | |
| 0.03 | 113 | |
| 0.04 | 86 |
holiday_season
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 3 | 3 |
| Median length | 3 | 3 |
| Mean length | 2.500214 | 2.505 |
| Min length | 2 | 2 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | No | No |
| 2nd row | No | No |
| 3rd row | Yes | No |
| 4th row | Yes | No |
| 5th row | Yes | No |
| Value | Count | Frequency (%) |
| yes | 500214 | |
| no | 499786 |
| Value | Count | Frequency (%) |
| yes | 15150 | |
| no | 14850 |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 500214 | |
| e | 500214 | |
| s | 500214 | |
| N | 499786 | |
| o | 499786 |
| Value | Count | Frequency (%) |
| Y | 15150 | |
| e | 15150 | |
| s | 15150 | |
| N | 14850 | |
| o | 14850 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2500214 |
| Value | Count | Frequency (%) |
| (unknown) | 75150 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| Y | 500214 | |
| e | 500214 | |
| s | 500214 | |
| N | 499786 | |
| o | 499786 |
| Value | Count | Frequency (%) |
| Y | 15150 | |
| e | 15150 | |
| s | 15150 | |
| N | 14850 | |
| o | 14850 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2500214 |
| Value | Count | Frequency (%) |
| (unknown) | 75150 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| Y | 500214 | |
| e | 500214 | |
| s | 500214 | |
| N | 499786 | |
| o | 499786 |
| Value | Count | Frequency (%) |
| Y | 15150 | |
| e | 15150 | |
| s | 15150 | |
| N | 14850 | |
| o | 14850 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2500214 |
| Value | Count | Frequency (%) |
| (unknown) | 75150 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| Y | 500214 | |
| e | 500214 | |
| s | 500214 | |
| N | 499786 | |
| o | 499786 |
| Value | Count | Frequency (%) |
| Y | 15150 | |
| e | 15150 | |
| s | 15150 | |
| N | 14850 | |
| o | 14850 |
season
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 6 | 6 |
| Mean length | 5.500322 | 5.493333333 |
| Min length | 4 | 4 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Spring | Spring |
| 2nd row | Summer | Spring |
| 3rd row | Winter | Spring |
| 4th row | Winter | Winter |
| 5th row | Summer | Fall |
| Value | Count | Frequency (%) |
| winter | 250307 | |
| spring | 250169 | |
| fall | 249839 | |
| summer | 249685 |
| Value | Count | Frequency (%) |
| fall | 7600 | |
| winter | 7519 | |
| spring | 7492 | |
| summer | 7389 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 750161 | |
| i | 500476 | |
| n | 500476 | |
| e | 499992 | |
| S | 499854 | |
| l | 499678 | |
| m | 499370 | |
| W | 250307 | 4.6% |
| t | 250307 | 4.6% |
| p | 250169 | 4.5% |
| Other values (4) | 999532 |
| Value | Count | Frequency (%) |
| r | 22400 | |
| l | 15200 | |
| n | 15011 | |
| i | 15011 | |
| e | 14908 | |
| S | 14881 | |
| m | 14778 | |
| F | 7600 | 4.6% |
| a | 7600 | 4.6% |
| W | 7519 | 4.6% |
| Other values (4) | 29892 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5500322 |
| Value | Count | Frequency (%) |
| (unknown) | 164800 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 750161 | |
| i | 500476 | |
| n | 500476 | |
| e | 499992 | |
| S | 499854 | |
| l | 499678 | |
| m | 499370 | |
| W | 250307 | 4.6% |
| t | 250307 | 4.6% |
| p | 250169 | 4.5% |
| Other values (4) | 999532 |
| Value | Count | Frequency (%) |
| r | 22400 | |
| l | 15200 | |
| n | 15011 | |
| i | 15011 | |
| e | 14908 | |
| S | 14881 | |
| m | 14778 | |
| F | 7600 | 4.6% |
| a | 7600 | 4.6% |
| W | 7519 | 4.6% |
| Other values (4) | 29892 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5500322 |
| Value | Count | Frequency (%) |
| (unknown) | 164800 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 750161 | |
| i | 500476 | |
| n | 500476 | |
| e | 499992 | |
| S | 499854 | |
| l | 499678 | |
| m | 499370 | |
| W | 250307 | 4.6% |
| t | 250307 | 4.6% |
| p | 250169 | 4.5% |
| Other values (4) | 999532 |
| Value | Count | Frequency (%) |
| r | 22400 | |
| l | 15200 | |
| n | 15011 | |
| i | 15011 | |
| e | 14908 | |
| S | 14881 | |
| m | 14778 | |
| F | 7600 | 4.6% |
| a | 7600 | 4.6% |
| W | 7519 | 4.6% |
| Other values (4) | 29892 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5500322 |
| Value | Count | Frequency (%) |
| (unknown) | 164800 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 750161 | |
| i | 500476 | |
| n | 500476 | |
| e | 499992 | |
| S | 499854 | |
| l | 499678 | |
| m | 499370 | |
| W | 250307 | 4.6% |
| t | 250307 | 4.6% |
| p | 250169 | 4.5% |
| Other values (4) | 999532 |
| Value | Count | Frequency (%) |
| r | 22400 | |
| l | 15200 | |
| n | 15011 | |
| i | 15011 | |
| e | 14908 | |
| S | 14881 | |
| m | 14778 | |
| F | 7600 | 4.6% |
| a | 7600 | 4.6% |
| W | 7519 | 4.6% |
| Other values (4) | 29892 |
weekend
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 3 | 3 |
| Median length | 2 | 3 |
| Mean length | 2.499333 | 2.500333333 |
| Min length | 2 | 2 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | Yes | Yes |
| 2nd row | Yes | Yes |
| 3rd row | Yes | Yes |
| 4th row | No | No |
| 5th row | Yes | No |
| Value | Count | Frequency (%) |
| no | 500667 | |
| yes | 499333 |
| Value | Count | Frequency (%) |
| yes | 15010 | |
| no | 14990 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 500667 | |
| o | 500667 | |
| Y | 499333 | |
| e | 499333 | |
| s | 499333 |
| Value | Count | Frequency (%) |
| Y | 15010 | |
| e | 15010 | |
| s | 15010 | |
| N | 14990 | |
| o | 14990 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2499333 |
| Value | Count | Frequency (%) |
| (unknown) | 75010 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 500667 | |
| o | 500667 | |
| Y | 499333 | |
| e | 499333 | |
| s | 499333 |
| Value | Count | Frequency (%) |
| Y | 15010 | |
| e | 15010 | |
| s | 15010 | |
| N | 14990 | |
| o | 14990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2499333 |
| Value | Count | Frequency (%) |
| (unknown) | 75010 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 500667 | |
| o | 500667 | |
| Y | 499333 | |
| e | 499333 | |
| s | 499333 |
| Value | Count | Frequency (%) |
| Y | 15010 | |
| e | 15010 | |
| s | 15010 | |
| N | 14990 | |
| o | 14990 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2499333 |
| Value | Count | Frequency (%) |
| (unknown) | 75010 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 500667 | |
| o | 500667 | |
| Y | 499333 | |
| e | 499333 | |
| s | 499333 |
| Value | Count | Frequency (%) |
| Y | 15010 | |
| e | 15010 | |
| s | 15010 | |
| N | 14990 | |
| o | 14990 |
customer_support_calls
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 20 | 20 |
| Distinct (%) | < 0.1% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 9.496269 | 9.496366667 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 19 | 19 |
| Zeros | 49755 | 1560 |
| Zeros (%) | 5.0% | 5.2% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 1 | 0 |
| Q1 | 4 | 4 |
| median | 9 | 10 |
| Q3 | 14 | 15 |
| 95-th percentile | 18 | 18 |
| Maximum | 19 | 19 |
| Range | 19 | 19 |
| Interquartile range (IQR) | 10 | 11 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 5.761232791 | 5.790587578 |
| Coefficient of variation (CV) | 0.606683824 | 0.6097687443 |
| Kurtosis | -1.204539564 | -1.215817005 |
| Mean | 9.496269 | 9.496366667 |
| Median Absolute Deviation (MAD) | 5 | 5 |
| Skewness | 0.001572025506 | -0.004931043499 |
| Sum | 9496269 | 284891 |
| Variance | 33.19180327 | 33.5309045 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 50608 | 5.1% |
| 8 | 50350 | 5.0% |
| 4 | 50334 | 5.0% |
| 12 | 50312 | 5.0% |
| 2 | 50158 | 5.0% |
| 11 | 50151 | 5.0% |
| 16 | 50087 | 5.0% |
| 13 | 50074 | 5.0% |
| 9 | 50053 | 5.0% |
| 7 | 50050 | 5.0% |
| Other values (10) | 497823 |
| Value | Count | Frequency (%) |
| 13 | 1567 | 5.2% |
| 0 | 1560 | 5.2% |
| 11 | 1558 | 5.2% |
| 8 | 1556 | 5.2% |
| 16 | 1551 | 5.2% |
| 17 | 1547 | 5.2% |
| 1 | 1536 | 5.1% |
| 5 | 1531 | 5.1% |
| 3 | 1519 | 5.1% |
| 18 | 1515 | 5.1% |
| Other values (10) | 14560 |
| Value | Count | Frequency (%) |
| 0 | 49755 | |
| 1 | 49530 | |
| 2 | 50158 | |
| 3 | 50608 | |
| 4 | 50334 |
| Value | Count | Frequency (%) |
| 0 | 1560 | |
| 1 | 1536 | |
| 2 | 1456 | |
| 3 | 1519 | |
| 4 | 1485 |
| Value | Count | Frequency (%) |
| 0 | 1560 | |
| 1 | 1536 | |
| 2 | 1456 | |
| 3 | 1519 | |
| 4 | 1485 |
| Value | Count | Frequency (%) |
| 0 | 49755 | |
| 1 | 49530 | |
| 2 | 50158 | |
| 3 | 50608 | |
| 4 | 50334 |
email_subscriptions
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 3 | 3 |
| Median length | 2 | 3 |
| Mean length | 2.499938 | 2.501266667 |
| Min length | 2 | 2 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | No | No |
| 2nd row | No | No |
| 3rd row | Yes | Yes |
| 4th row | No | No |
| 5th row | No | No |
| Value | Count | Frequency (%) |
| no | 500062 | |
| yes | 499938 |
| Value | Count | Frequency (%) |
| yes | 15038 | |
| no | 14962 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 500062 | |
| o | 500062 | |
| Y | 499938 | |
| e | 499938 | |
| s | 499938 |
| Value | Count | Frequency (%) |
| Y | 15038 | |
| e | 15038 | |
| s | 15038 | |
| N | 14962 | |
| o | 14962 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2499938 |
| Value | Count | Frequency (%) |
| (unknown) | 75038 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 500062 | |
| o | 500062 | |
| Y | 499938 | |
| e | 499938 | |
| s | 499938 |
| Value | Count | Frequency (%) |
| Y | 15038 | |
| e | 15038 | |
| s | 15038 | |
| N | 14962 | |
| o | 14962 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2499938 |
| Value | Count | Frequency (%) |
| (unknown) | 75038 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 500062 | |
| o | 500062 | |
| Y | 499938 | |
| e | 499938 | |
| s | 499938 |
| Value | Count | Frequency (%) |
| Y | 15038 | |
| e | 15038 | |
| s | 15038 | |
| N | 14962 | |
| o | 14962 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2499938 |
| Value | Count | Frequency (%) |
| (unknown) | 75038 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 500062 | |
| o | 500062 | |
| Y | 499938 | |
| e | 499938 | |
| s | 499938 |
| Value | Count | Frequency (%) |
| Y | 15038 | |
| e | 15038 | |
| s | 15038 | |
| N | 14962 | |
| o | 14962 |
app_usage
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 4 | 4 |
| Mean length | 4.334299 | 4.3313 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | High | High |
| 2nd row | High | Low |
| 3rd row | Low | Medium |
| 4th row | Low | Low |
| 5th row | Medium | Medium |
| Value | Count | Frequency (%) |
| medium | 333822 | |
| low | 333345 | |
| high | 332833 |
| Value | Count | Frequency (%) |
| low | 10131 | |
| medium | 10035 | |
| high | 9834 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 666655 | |
| M | 333822 | |
| e | 333822 | |
| d | 333822 | |
| u | 333822 | |
| m | 333822 | |
| L | 333345 | |
| o | 333345 | |
| w | 333345 | |
| H | 332833 | |
| Other values (2) | 665666 |
| Value | Count | Frequency (%) |
| i | 19869 | |
| L | 10131 | |
| w | 10131 | |
| o | 10131 | |
| M | 10035 | |
| e | 10035 | |
| d | 10035 | |
| u | 10035 | |
| m | 10035 | |
| H | 9834 | |
| Other values (2) | 19668 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4334299 |
| Value | Count | Frequency (%) |
| (unknown) | 129939 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 666655 | |
| M | 333822 | |
| e | 333822 | |
| d | 333822 | |
| u | 333822 | |
| m | 333822 | |
| L | 333345 | |
| o | 333345 | |
| w | 333345 | |
| H | 332833 | |
| Other values (2) | 665666 |
| Value | Count | Frequency (%) |
| i | 19869 | |
| L | 10131 | |
| w | 10131 | |
| o | 10131 | |
| M | 10035 | |
| e | 10035 | |
| d | 10035 | |
| u | 10035 | |
| m | 10035 | |
| H | 9834 | |
| Other values (2) | 19668 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4334299 |
| Value | Count | Frequency (%) |
| (unknown) | 129939 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 666655 | |
| M | 333822 | |
| e | 333822 | |
| d | 333822 | |
| u | 333822 | |
| m | 333822 | |
| L | 333345 | |
| o | 333345 | |
| w | 333345 | |
| H | 332833 | |
| Other values (2) | 665666 |
| Value | Count | Frequency (%) |
| i | 19869 | |
| L | 10131 | |
| w | 10131 | |
| o | 10131 | |
| M | 10035 | |
| e | 10035 | |
| d | 10035 | |
| u | 10035 | |
| m | 10035 | |
| H | 9834 | |
| Other values (2) | 19668 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4334299 |
| Value | Count | Frequency (%) |
| (unknown) | 129939 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 666655 | |
| M | 333822 | |
| e | 333822 | |
| d | 333822 | |
| u | 333822 | |
| m | 333822 | |
| L | 333345 | |
| o | 333345 | |
| w | 333345 | |
| H | 332833 | |
| Other values (2) | 665666 |
| Value | Count | Frequency (%) |
| i | 19869 | |
| L | 10131 | |
| w | 10131 | |
| o | 10131 | |
| M | 10035 | |
| e | 10035 | |
| d | 10035 | |
| u | 10035 | |
| m | 10035 | |
| H | 9834 | |
| Other values (2) | 19668 |
website_visits
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 100 | 100 |
| Distinct (%) | < 0.1% | 0.3% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 49.512951 | 49.32383333 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 99 | 99 |
| Zeros | 10111 | 296 |
| Zeros (%) | 1.0% | 1.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 4 | 5 |
| Q1 | 25 | 24 |
| median | 50 | 49 |
| Q3 | 75 | 74 |
| 95-th percentile | 95 | 94 |
| Maximum | 99 | 99 |
| Range | 99 | 99 |
| Interquartile range (IQR) | 50 | 50 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 28.86977699 | 28.83333737 |
| Coefficient of variation (CV) | 0.5830752644 | 0.584572111 |
| Kurtosis | -1.199464505 | -1.200055603 |
| Mean | 49.512951 | 49.32383333 |
| Median Absolute Deviation (MAD) | 25 | 25 |
| Skewness | -0.0006306812576 | 0.008063985979 |
| Sum | 49512951 | 1479715 |
| Variance | 833.4640237 | 831.361344 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 58 | 10304 | 1.0% |
| 95 | 10250 | 1.0% |
| 50 | 10235 | 1.0% |
| 62 | 10177 | 1.0% |
| 45 | 10175 | 1.0% |
| 13 | 10166 | 1.0% |
| 38 | 10160 | 1.0% |
| 84 | 10147 | 1.0% |
| 98 | 10136 | 1.0% |
| 93 | 10132 | 1.0% |
| Other values (90) | 898118 |
| Value | Count | Frequency (%) |
| 25 | 338 | 1.1% |
| 5 | 337 | 1.1% |
| 53 | 335 | 1.1% |
| 12 | 332 | 1.1% |
| 85 | 331 | 1.1% |
| 46 | 330 | 1.1% |
| 44 | 328 | 1.1% |
| 8 | 324 | 1.1% |
| 76 | 324 | 1.1% |
| 4 | 323 | 1.1% |
| Other values (90) | 26698 |
| Value | Count | Frequency (%) |
| 0 | 10111 | |
| 1 | 9997 | |
| 2 | 9933 | |
| 3 | 10007 | |
| 4 | 9969 |
| Value | Count | Frequency (%) |
| 0 | 296 | |
| 1 | 274 | |
| 2 | 276 | |
| 3 | 304 | |
| 4 | 323 |
| Value | Count | Frequency (%) |
| 0 | 296 | |
| 1 | 274 | |
| 2 | 276 | |
| 3 | 304 | |
| 4 | 323 |
| Value | Count | Frequency (%) |
| 0 | 10111 | |
| 1 | 9997 | |
| 2 | 9933 | |
| 3 | 10007 | |
| 4 | 9969 |
social_media_engagement
['Text', 'Text']
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Length
| Full Dataset | Systematic Sample | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 4 | 4 |
| Mean length | 4.332057 | 4.343133333 |
| Min length | 3 | 3 |
Unique
| Full Dataset | Systematic Sample | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Full Dataset | Systematic Sample | |
|---|---|---|
| 1st row | High | High |
| 2nd row | Medium | High |
| 3rd row | Medium | Low |
| 4th row | Low | Low |
| 5th row | Low | Medium |
| Value | Count | Frequency (%) |
| low | 334073 | |
| medium | 333065 | |
| high | 332862 |
| Value | Count | Frequency (%) |
| medium | 10128 | |
| low | 9962 | |
| high | 9910 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 665927 | |
| L | 334073 | |
| w | 334073 | |
| o | 334073 | |
| M | 333065 | |
| e | 333065 | |
| d | 333065 | |
| u | 333065 | |
| m | 333065 | |
| H | 332862 | |
| Other values (2) | 665724 |
| Value | Count | Frequency (%) |
| i | 20038 | |
| M | 10128 | |
| e | 10128 | |
| d | 10128 | |
| u | 10128 | |
| m | 10128 | |
| L | 9962 | |
| o | 9962 | |
| w | 9962 | |
| H | 9910 | |
| Other values (2) | 19820 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4332057 |
| Value | Count | Frequency (%) |
| (unknown) | 130294 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 665927 | |
| L | 334073 | |
| w | 334073 | |
| o | 334073 | |
| M | 333065 | |
| e | 333065 | |
| d | 333065 | |
| u | 333065 | |
| m | 333065 | |
| H | 332862 | |
| Other values (2) | 665724 |
| Value | Count | Frequency (%) |
| i | 20038 | |
| M | 10128 | |
| e | 10128 | |
| d | 10128 | |
| u | 10128 | |
| m | 10128 | |
| L | 9962 | |
| o | 9962 | |
| w | 9962 | |
| H | 9910 | |
| Other values (2) | 19820 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4332057 |
| Value | Count | Frequency (%) |
| (unknown) | 130294 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 665927 | |
| L | 334073 | |
| w | 334073 | |
| o | 334073 | |
| M | 333065 | |
| e | 333065 | |
| d | 333065 | |
| u | 333065 | |
| m | 333065 | |
| H | 332862 | |
| Other values (2) | 665724 |
| Value | Count | Frequency (%) |
| i | 20038 | |
| M | 10128 | |
| e | 10128 | |
| d | 10128 | |
| u | 10128 | |
| m | 10128 | |
| L | 9962 | |
| o | 9962 | |
| w | 9962 | |
| H | 9910 | |
| Other values (2) | 19820 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4332057 |
| Value | Count | Frequency (%) |
| (unknown) | 130294 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 665927 | |
| L | 334073 | |
| w | 334073 | |
| o | 334073 | |
| M | 333065 | |
| e | 333065 | |
| d | 333065 | |
| u | 333065 | |
| m | 333065 | |
| H | 332862 | |
| Other values (2) | 665724 |
| Value | Count | Frequency (%) |
| i | 20038 | |
| M | 10128 | |
| e | 10128 | |
| d | 10128 | |
| u | 10128 | |
| m | 10128 | |
| L | 9962 | |
| o | 9962 | |
| w | 9962 | |
| H | 9910 | |
| Other values (2) | 19820 |
days_since_last_purchase
Real number (ℝ)
| Full Dataset | Systematic Sample | |
|---|---|---|
| Distinct | 365 | 365 |
| Distinct (%) | < 0.1% | 1.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 182.027559 | 181.6553333 |
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 364 | 364 |
| Zeros | 2768 | 98 |
| Zeros (%) | 0.3% | 0.3% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 7.6 MiB | 234.5 KiB |
Quantile statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 18 | 18 |
| Q1 | 91 | 91 |
| median | 182 | 181 |
| Q3 | 273 | 273 |
| 95-th percentile | 346 | 346 |
| Maximum | 364 | 364 |
| Range | 364 | 364 |
| Interquartile range (IQR) | 182 | 182 |
Descriptive statistics
| Full Dataset | Systematic Sample | |
|---|---|---|
| Standard deviation | 105.3645979 | 105.2680916 |
| Coefficient of variation (CV) | 0.5788387123 | 0.5794935371 |
| Kurtosis | -1.199912738 | -1.197007882 |
| Mean | 182.027559 | 181.6553333 |
| Median Absolute Deviation (MAD) | 91 | 91 |
| Skewness | -0.0005543132091 | 0.005172950866 |
| Sum | 182027559 | 5449660 |
| Variance | 11101.69848 | 11081.37112 |
| Monotonicity | Not monotonic | Not monotonic |
| Value | Count | Frequency (%) |
| 53 | 2916 | 0.3% |
| 72 | 2890 | 0.3% |
| 98 | 2888 | 0.3% |
| 252 | 2869 | 0.3% |
| 364 | 2867 | 0.3% |
| 6 | 2862 | 0.3% |
| 325 | 2857 | 0.3% |
| 136 | 2843 | 0.3% |
| 267 | 2833 | 0.3% |
| 239 | 2832 | 0.3% |
| Other values (355) | 971343 |
| Value | Count | Frequency (%) |
| 180 | 110 | 0.4% |
| 46 | 110 | 0.4% |
| 147 | 107 | 0.4% |
| 224 | 105 | 0.4% |
| 75 | 105 | 0.4% |
| 32 | 104 | 0.3% |
| 293 | 103 | 0.3% |
| 299 | 102 | 0.3% |
| 266 | 102 | 0.3% |
| 308 | 101 | 0.3% |
| Other values (355) | 28951 |
| Value | Count | Frequency (%) |
| 0 | 2768 | |
| 1 | 2752 | |
| 2 | 2701 | |
| 3 | 2709 | |
| 4 | 2786 |
| Value | Count | Frequency (%) |
| 0 | 98 | |
| 1 | 86 | |
| 2 | 88 | |
| 3 | 83 | |
| 4 | 69 |
| Value | Count | Frequency (%) |
| 0 | 98 | |
| 1 | 86 | |
| 2 | 88 | |
| 3 | 83 | |
| 4 | 69 |
| Value | Count | Frequency (%) |
| 0 | 2768 | |
| 1 | 2752 | |
| 2 | 2701 | |
| 3 | 2709 | |
| 4 | 2786 |